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Date: December 18. 1997 
Docket No . : 0020-4348P 



Assistant Commissioner for Patents 
Box PATENT APPLICATION 
Washington, D.C, 20231 

Sir: 

As authorized by the inventor(s), transmitted herewith for filing 
is a patent application applied for on behalf of the inventor (s) 
according to the provisions of 37 CFR 1,41 (c) . 



Inventor (s) : 



WATANABE, Ei j iro 
OEDA, Kenji 



For: RAFFINOSE SYNTHASE GENES AND THEIR USE 

Enclosed are: 
y /x A specification consisting of 72 pages 

3 sheet (s) of formal drawings 
Certified copy of Priority Document (s) 

Executed Declaration in accordance with 3 7 CFR 1.64 will follow 



A 



J 



A verified statement to establish small entity status under 37 
CFR 1.9 and 37 CFR 1.27 

Preliminary Amendment 

X Information Sheet 

jL_ Information Disclosure Statement, PTO-1449 with reference (s) 



Mael Address: P. O. Box 747, Falls Ghtjhoh, Vihgikia, USA 32040-0747 
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The filing fee has been calculated as shown below: 
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X The application transmitted herewith is filed in accordance 

with 37 CFR 1.41(c) . The undersigned has been authorized by the 
inventor (s) to file the present application. The original duly 
executed patent application together with the surcharge will be 
forwarded in due course. 

\ /x A check in the amount of $ 2 , 332 . 00 to cover the filing fee and 
recording fee (if applicable) is enclosed. 

The Government Filing Fee will be paid at the time of completion 

of the filing requirement . 

Please charge Deposit Account No. 02-2448 in the amount of 

$ . A triplicate copy of this transmittal form is 

enclosed. 

X Send Correspondence to: BIRCH, STEWART, KOLASCH & BIRCH, LLP 

P. 0. Box 747 

Palls Church, Virginia 22040-0747 
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No fee is enclosed. 

If necessary, the Commissioner is hereby authorized in this, 
concurrent, and future replies, to charge payment or credit any 
overpayment to Deposit Account No. 02-2448 for any additional 
fees required under 3 7 C.F.R. 1.16 or under 3 7 C.F.R. 1.17; 
particularly, extension of time fees. 



Respectfully submitted. 



BIRCH, STEWART, KOLASCH & BIRCH, LLP 




Falls Church, Virginia 22040-0747 



(703) 205-8000 
GMM/djm 
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IN THE U.S. PATENT AND TRADEMARK OFFICE 
INFORMATION SHEET 

Applicant: WATANABE, Ei j iro 

OEDA, Kenji 

Application No.: 

Filed: December 18, 1997 

For: RAFFINOSE SYNTHASE GENES AND THEIR USE 

Priority Claimed: 

COUNTRY DATE NUMBER 

Japan 12/18/96 8-338673 



Send Correspondence to: BIRCH, STEWART, KOLASCH & BIRCH, LLP 
P. 0. Box 747 

Falls Church, Virginia 22040-0747 
(703) 205-8000 

The above information is submitted to advise the USPTO of all 
relevant facts in connection with the present application. A timely- 
executed Declaration in accordance with 37 CFR 1.64 will follow. 

Respectfully submitted, 

BIRCH, STEWART, KOLASCH & BIRCH, LLP 
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^M //MURPHY, JR. 

28, 977 
'Box 747 
Falls Church, VA 22040-0747 



(703) 205-8000 
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IN THE U.S. PATENT AND TRADEMARK OFFICE 
Applicants: WATANABE et al 

Serial No.: New Group: 
Filed: December 18, 1997 Examiner: 

For: RAFFINOSE SYNTHASE GENES AND THEIR USE 

PRELIMINARY AMENDMENT 

Assistant Commissioner for Patents December 18, 1997 

Box PATENT APPLICATION 
Washington, D.C. 20231 

Sir: 

The following preliminary amendments and remarks are 
respectfully submitted in connection with the above-identified 
application . 



IN THE CLAIMS : 

Please amend the claims as follows: 



CLAIM 


21: 


Line 


2, 


delete 


", 2, 3, 4, 


7, 10, 


11, 14, 15 or 16" 


CLAIM 


22: 


Line 


2, 


delete 


", 6, 8, 9, 


12, 13, 


17 or 18" 


CLAIM 


24: 


Line 


3, 


change 


", 22 or 23" 


to to 


--or 22-- 


CLAIM 


25: 


Line 


3, 


change 


22 or 23" 


to to 


--or 22-- 


CLAIM 


26: 


Line 


3, 


change 


", 22 or 23" 


to to 


--or 22-- 


CLAIM 


27: 


Line 


3, 


change 


22 or 23" 


to to 


--or 22-- 


CLAIM 


28: 


Line 


3, 


delete 


", 25, 26 or 


27" 




CLAIM 


29: 


Line 


3, 


delete 


", 25, 26 or 


27" 




CLAIM 


30: 


Lines 1 


and 2, 


delete ", 2, 


3, 4, 


5, 6, 7, 8, 9, 10, 
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11, 12, 13, 14, 15, 16, 17, 18 or 29 



CLAIM 32: Lines 1 and 2, delete ", 2, 3, 4, 5, 6, 7, 8, 9, 10, 

11, 12, 13, 14, 15, 16, 17, 18, 29 or 30" 

CLAIM 36: Lines 2 and 3, delete 2, 3, 4, 5, 6, 7, 8, 9, 10, 

11, 12, 13, 14, 15, 16, 17, 18, 29 or 30" 



The above amendment to the claims merely corrects the 
improper multiple dependencies and places the application into 
better form prior to examination. 

Favorable action on the above -identified application is 
respectfully requested. 

If necessary, the Commissioner is hereby authorized in this, 
concurrent, and future replies, to charge payment or credit any 
overpayment to Deposit Account No. 02-2448 for any additional 
fees required under 37 C.F.R. §1.16 or under 37 C.F.R. §1.17; 
particularly, extension of time fees. 



* * * REMARKS * * * 



Respectfully submitted. 



BIRCH, STEWART, KOLASCH & BIRCH, LLP 




(703) 205-8000 



GMM/djm 
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RAFFINOSE SYNTHASE GENES AND THEIR USE 



FIELD OF INVENTION 
The present invention relates to raffinose synthase genes and their use. 



BACKGROUND OF THE INVENTION 
5 Raffinose faimly oligosaccharides are derivatives of sucrose, which are repre- 

sented by o-a-D-galactopyranosyl-(l->6) n-o-a-D-glucopyranosyl-(1^2)-P-D-flucto- 
furanoside as the general formula, and they are designated "raffinose" when n = 1, 
"stachyose" when n = 2, "verbascose" when n = 3, and "ajugose" when n = 4. 

The greatest contents of such raffinose family oligosaccharides are found in 
1 0 plants, except for sucrose, and it has been known that they are contained not only in 
higher plants including gymnosperms such as pinaceous plants (e.g., spruce) and 
angiosperms such as leguminous plants (e.g., soybean, kidney bean), brassicaceous 
plants (e.g., rape), chenopodiaceous plants (e.g., sugar beet), malvaceous plants (e.g., 
cotton), and salicaceous plants (e.g., poplar), but also in green algae, chlorella. Thus, 
1 5 they occur widely in the plant kingdom similarly to sucrose. 

Raffinose family oligosaccharides play a role as reserve sugars in the storage 
organs or seeds of many plants or as translocating sugars in the phenomenon of sugar 
transportation between the tissues of some plants. 

Furthermore, it has been known that raffinose family oligosaccharides have 
20 an effect of giving good conditions of enterobacterial flora, if present at a suitable amount 
in food. Therefore, raffinose family oligosaccharides have akeady been used as a 
functional food material for addition to some kinds of food and utilized in the field of 
specified healthful food. 

Raffinose family oligosaccharides having such a role and utility are produced 
25 by the raffinose oligosaccharide synthesis system beginning with sucrose in many plants. 
This biosynthesis system usually involves a reaction for the sequential addition of 
galactosyl groups from galactotinol through an a(l->6) bond to a hydroxyl group 



attached to the carbon atom at position 6 of a D-glucose residue in a sucrose molecule. 

In the first step of this biosynthesis system, raffinose synthase is an enzyme 
concerned in the reaction of raffinose production by combining a D-galactosyl group from 
galactotinol through an a(1^6) bond with a hydroxyl group attached to the carbon atom 
5 at position 6 of a D-glucose residue in a sucrose molecule. It has been suggested that this 
enzyme constitutes a rate limiting step in the above synthesis system, and it has been 
revealed that this enzyme is quite important in the control of biosynthesis of raffinose 
family oligosaccharides. 

The control of expression level or activity of raffinose synthase in plants 

1 0 makes it possible to change the contents of raffinose family oligosaccharides in these 
plants. How^ever, raffinose synthase, although the presence of this enzyme itself was 
already confirmed in many plants by the measurement of its activity with a biochemical 
technique, has not yet been successfully isolated and purified as a homogeneous protein. 
In addition, the amino acid sequence of this enzyme is stUl unknown, and no report has 

1 5 been made on an attempt at beginning to isolate a gene coding for this enzyme. 



SUMMARY OF THE INVENTION 
Under these circumstances, the present inventors have intensively studied and 
finally succeeded in isolating a raffinose synthase and a gene coding for this enzyme from 
broad bean, thereby completing the present invention. 
20 Thus, the present invention provides the following: 

1) A raffinose synthase gene isolated from a plant and having a nucleotide 
sequence coding for an amino acid sequence of a protein capable of producing raffinose 
by combining a D-galactosyl group through an a(l— »6) bond with a hydroxyl group 
attached to the carbon atom at position 6 of a D-glucose residue in a sucrose molecule. 
25 2) The raffinose synthase gene according to item 1, wherein the plant is a 

dicotyledon. 

3) The raffinose synthase gene according to item 2, wherein the dicotyle- 
2 



don is a leguminous plant. 

4) The raffinose synthase gene according to item 3, wherein the legumi- 
nous plant is broad bean. 

5) A raffinose synthase gene having a nucleotide sequence coding for 
5 protein (a) or (b) as defined below: 

(a) protein having the amino acid sequence of SEQ LD NO : 1 ; 

(b) protein having an amino acid sequence derived by deletion, replace- 
ment, modification or addition of one or several amino acids intfie aradno acid sequence of 
SEQ ID NO:l, and capable of producing raffinose by combining a D-galactosyl group 

1 0 through an a(l-^6) bond with a hydroxyl group attached to the carbon atom at position 6 
of a D-glucose residue in a sucrose molecule. 

6) A raffinose synthase gene having the nucleotide sequence of SEQ ID 

N0:2. 

7) The raffinose synthase gene according to item 3, wherein the legumi- 
1 5 nous plant is soybean. 

8) A raffinose synthase gene having a nucleotide sequence coding for 
protein (a) or (b) as defined below: 

(a) protein having the amino acid sequence of SEQ ID NO:3; 

(b) protein having an amino acid sequence derived by deletion, replace- 
20 ment, modification or addition of one or several amino acids in the amino add sequence of 

SEQ ID NO:3, and capable of producing raffinose by combining a D-galactosyl group 
through an a(l— >6) bond with a hydroxyl group attached to the carbon atom at position 6 
of a D-glucose residue in a sucrose molecule. 

9) A raffinose synthase gene having the nucleotide sequence of SEQ ID 

25 NO:4. 

10) The raffinose synthase gene according to item 2, wherein the dicotyle- 
don is a lamiaceous plant. 

11) The raffinose synthase gene according to item 10, wherein the lamia- 

3 



ceous plant is Japanese artichoke. 

12) A raffinose synthase gene having a nucleotide sequence coding for the 
amino acid sequence of SEQ ID N0:5. 

13) A raffmose synthase gene having the nucleotide sequence of SEQ ID 

5 NO:6. 

14) The raffinose synthase gene according to item 1, wherein the plant is a 
monocotyledon. 

15) The raffinose synthase gene according to item 14, wherein the mono- 
cotyledon is a gramineous plant. 

10 16) The raffinose synthase gene according to item 15, wherein the 

gramineous plant is com. 

17) A raffinose synthase gene having a nucleotide sequence coding for the 
amino acid sequence of SEQ ID N0:7. 

18) A raffinose synthase gene having the nucleotide sequence of SEQ ED 

15 NO:8. 

19) A raffinose synthase protein having amino acid sequence (a) or (b) as 
defined below: 

(a) amino acid sequence of SEQ ID NO: 1 or SEQ ID NO:3 ; 

(b) amino acid sequence derived by deletion, replacement, modification or 
20 addition of one or several amino acids in the amino acid sequence of SEQ ID NO: 1 or 

SEQIDNO:3; 

the protein being capable of producing raffinose by combining a D-galactosyl 
group through an a(l->6) bond with a hydroxyl group attached to the carbon atom at 
position 6 of a D-glucose residue in a sucrose molecule. 
25 20) A raffinose synthase protein having the amino acid sequence of SEQ ID 

NO:l orSEQIDNO:3. 

21) A gene fragment having a partial nucleotide sequence of the raffinose 
synthase gene of item 1, 2, 3, 4, 7, 10, 11, 14, 15 or 16. 



22) A gene fragment having a partial nucleotide sequence of the raffmose 
synthase gene of item 5, 6, 8, 9, 12, 13, 17 or 18. 

23) The gene fragment according to item 21 or 22, wherein the number of 
nucleotides is in the range of from 15 to 50. 

5 24) A method for the detection of a raffinose synthase gene or a gene 

fragment having a partial nucleotide sequence thereof, which comprises hybridizing a 
probe of the labeled gene fragment of item 21, 22 or 23 to an organism-derived genomic 
DNA or cDNA fragment; and detecting the DNA fragment bound specifically to the 
probe. 

10 25) A method for the detection of a raffinose synthase gene or a gene 

fragment having a partial nucleotide sequence thereof, which comprises hybridizing a 
probe of the labeled gene fragment of item 21, 22 or 23 to a plant-derived genomic DNA 
or cDNA fragment; and detecting the DNA fragment bound specifically to the probe. 

26) A method for the amplification of a raffinose synthase gene or a gene 
1 5 fragment having a partial nucleotide sequence thereof, which comprises annealing a 

primer having a nucleotide sequence of the gene fragment of item 21, 22 or 23 to 
organism-derived genomic DNA or cDNA; and amphfying the resulting DNA fragment 
by polymerase chain reaction. 

27) A method for the amplification of a raffinose synthase gene or a gene 
20 fragment having a partial nucleotide sequence thereof, which comprises annealing a 

primer having a nucleotide sequence of the gene fragment of item 21, 22 or 23 to plant- 
derived genomic DNA or cDNA; and amplifying the resulting DNA fragment by 
polymerase chain reaction. 

28) A method for obtaining a raffinose synthase gene, comprising the steps 
25 of identifying a DNA fragment containing a raffinose synthase gene or a gene fragment 

having a partial nucleotide sequence thereof by the method of item 24, 25, 26 or 27; and 
isolating and purifying the DNA fragment identified. 

29) A raffinose synthase gene obtained by identifying a DNA fragment 

5 



containing a raffinose synthase gene or a gene fragment having a partial nucleotide 
sequence thereof by the method of item 24, 25, 26 or 27; and isolating and purifying the 
DNA fragment identified. 

30) A chimera gene comprising the raffmose synthase gene of item 1, 2, 3, 
5 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18 or 29, and a promoter linked thereto. 

31) A transformant obtained by introducing the chimera gene of item 30 into 
a host organism. 

32) A plasmid comprising the raffinose synthase gene of item 1,2, 3, 4, 5, 
6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 29 or 30. 

1 0 33) A host organism transformed with the plasmid of item 32, or a cell 

thereof. 

34) A microorganism transformed with the plasmid of item 32. 

35) A plant transformed with the plasmid of item 32, or a cell thereof. 

36) A method for metabolic modification, which comprises introducing the 
15 raffinose synthase gene of item 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 

18, 29 or 30 into a host organism or a cell thereof, so that the content of raffinose family 
oligosaccharides in the host organism or the cell thereof is changed. 

37) A method for the production of a raffinose synthase protein, which 
comprises isolating and purifying a raffinose synthase protein from a culture obtained by 

20 cultivating the microorganism of item 34. 

38) An anti-raffmose synthase antibody capable of binding to the raffmose 
synthase protein of item 19 or 20. 

39) A method for the detection of a raffinose synthase protein, which 
comprises treating a test protein with the anti-raffinose synthase antibody of item 38; and 

25 detecting the raffinose synthase protein by antigen-antibody reaction between the antibody 
and the raffinose synthase protein. 
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BREF DESCRIPTION OF THE DRAWINGS 
Figure 1 shows the construction of plasmids used for the expression of a 
raffinose synthase gene in Escherichia coli. pBluescriptKS-RS is a plasmid containing 
the raffinose synthase gene cloned therein. RS represents the raffinose synthase gene, 

5 and the nucleotide sequences shown in the upper portion of this figure are those of both 
terminal portions of the raffinose synthase gene. A partial sequence represented by small 
letters is a nucleotide sequence derived from the vector pBluescriptll KS-. Two boxed 
nucleotide sequences are the initiation codon (ATG) and termination codon (TGATAA) of 
the raffinose synthase gene, respectively. The recognition sites for several restriction 

10 endonucleases are shown above the nucleotide sequences. pGEX-RS and pTrc-RS are 
plasmids used for the expression of the raffinose synthase gene in E. coli. Ptac, Ptrc, 
GST, lacl'5, and rmB represent tac promoter, trc promoter, glutathione-S -transferase 
gene, lactose repressor gene, and termination signal for the transcription of ribosomal 
RNA, respectively. 

1 5 Figure 2 shows the construction of expression vectors used for the expression 

in plants of chimera genes each having a raffinose synthase gene and a promoter linked 
thereto. The restriction endonuclease map of the raffinose synthase gene cloned in the 
plasmid pBluescriptKS-RS is shown in the lower portion of this figure. pBI221RS and 
pBI221(-)RS indicate the restriction endonuclease maps of expression vectors used for 

20 the transformation of soybean. 35S and NOS represent 35S promoter derived fi:om 
cauliflower mosaic virus and nopaline synthase gene terminator, respectively. 

Figure 3 shows the construction of expression vectors used for the expression 
in plants of chimera genes each having a raffinose synthase gene and a promoter linked 
thereto. The restriction endonuclease map of the raffinose synthase gene cloned in the 

25 plasmid pBluescriptKS-RS is shown in the upper portion of this figure. pBI121RS and 
pBI121(-)RS indicate the restriction endonuclease maps of binary vectors used for the 
transformation of mustard. For the binary vector, only a region between the right border 
and the left border is shown. 35S, NOS and NPT represent 35S promoter derived from 



cauliflower mosaic virus, nopaline synthase gene terminator and kanamycin resistance 
gene, respectively. 

DETAILED DESCRIPTION OF THE INVENTION 
The gene engineering methods described belov/ can be carried out according 
5 to ordinary methods, for example, as described in "Molecular Cloning: A Laboratory 
Manual 2nd edition" (1989), Cold Spring Harbor Laboratory Press, ISBN 0-87969- 
309-6; "Current Protocols In Molecular Biology" (1987), John Wiley & Sons, Inc. 
ISBN 0-471-50338-X; and "Current Protocols In Protein Science" (1995), John Wiley & 
Sons, Inc. ISBN 0-471-11184-8. 
1 0 The term "raffinose synthase gene" as used herein refers to a gene having a 

nucleotide . sequence coding for the amino acid sequence of a protein capable of producing 
raffmose by combining a D-galactosyl group through an a(1^6) bond with a hydroxyl 
group attached to the carbon atom at position 6 of a D-glucose residue in a sucrose 
molecule (hereinafter referred to simply as the present gene), and such a gene can be 
1 5 prepared, for example, from plants. 

More specifically, the present gene can be prepared from dicotyledons such as 
leguminous plants (e.g., broad bean, soybean) and lamiaceous plants (e.g., Japanese 
artichoke) or from monocotyledons such as gramineous plants (e.g., com). Specific 
examples of the present gene are a "raffinose synthase gene having a nucleotide sequence 
20 coding for a protein having the amino acid sequence of SEQ ID NO:!"; a "raffinose 
synthase gene having a nucleotide sequence coding for a protein having an amino acid 
sequence derived by deletion, replacement, modification or addition of one or several 
amino acids in the amino acid sequence of SEQ ID NO:l, and capable of producing 
raffinose by combining a D-galactosyl group through an a(1^6) bond with a hydroxyl 
25 group attached to the carbon atom at position 6 of a D-glucose residue in a sucrose 
molecule"; a "raffinose synthase gene having a nucleotide sequence coding for a protein 
having the amino acid sequence of SEQ ID NO:3"; a "raffinose synthase gene having a 
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nucleotide sequence coding for a protein having an amino acid sequence derived by 
deletion, replacement, modification or addition of one or several amino acids in the amino 
acid sequence of SEQ ID NO:3, and capable of producing raffmose by combining a D- 
galactosyl group through an a(l->6) bond with a hydroxyl group attached to the carbon 

5 atom at position 6 of a D-glucose residue in a sucrose molecule"; a "raffmose synthase 
gene having a nucleotide sequence coding for the amino acid sequence of SEQ ID N0:5"; 
and a "raffmose synthase gene having a nucleotide sequence coding for the amino acid 
sequence of SEQ ID N0:7." 

The present gene can be obtained, for example, by the following method. 

1 0 The tissues of a leguminous plant such as broad bean (Viciafaba) or soybean 

(Glycine max) are frozen in liquid nitrogen and ground physically with a mortar or other 
means into finely powdered tissue debris. From the tissue debris, RNA is extracted by an 
ordinary method. Commercially available RNA extraction kits can be utilized in the 
extraction. The whole RNA is separated from the RNA extract by ethanol precipitation. 

1 5 From the whole RNA separated, poly-A tailed RNA is fractionated by an ordinary 
method. Commercially available oligo-dT columns can be utilized in the fractionation. 
cDNA is synthesized from the fraction obtained (i.e., poly-A tailed RNA) by an ordinary 
method. Commercially available cDNA synthesis kits can be utilized in the synthesis. 

For example, cDNA fragments of the "raffmose synthase gene having a 

20 nucleotide sequence coding for a protein having the amino acid sequence of SEQ ID 
NO:!" as the present gene can be obtained by PCR amplification using the broad bean- 
derived cDNA obtained above as a template and primers 1 to 3 shown in list 1 below. 
The primers used therein can be designed and synthesized on the basis of the nucleotide 
sequence of SEQ ID NO:2, depending upon the purpose. For example, in order to 

25 amplify the open reading frame region of the "raffinose synthase gene having a nucleotide 
sequence coding for a protein having the amino acid sequence of SEQ ID NO:l," primers 
1 to 4 shown in list 2 below may be designed and synthesized. 

In the same manner, cDNA fragments of the "raffinose synthase gene having 
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a nucleotide sequence coding for a protein having the amino acid sequence of SEQ ID 
NO: 3" can be obtained by PGR amplification with the soybean-derived cDNA obtained 
above as a template and, for example, primers 4 to 6 shown in list 1 below. The primers 
used therein can be designed and synthesized on the basis of the nucleotide sequence of 
SEQ ID NO:4, depending upon the purpose. For example, in order to amphfy the open 
reading frame region of the "raffinose synthase gene having a nucleotide sequence coding 
for a protein having the amino acid sequence of SEQ ID NO:3," primers 5 to 8 shown in 
list 2 below may be designed and synthesized. 

The amplified DNA fragments can be subcloned according to ordinary 
methods, for example, as described in "Molecular Cloning: A Laboratory Manual 2nd 
edition" (1989), Cold Spring Harbor Laboratory Press; and "Current Protocols In 
Molecular Biology" (1987), John Wiley & Sons, Inc. ISBN 0-47 1-503 3 8 -X. More 
specifically, cloning can be effected, for example, using a TA cloning kit (Invitrogen) and 
a plasmid vector such as pBluescript 11 (Stratagene). The nucleotide sequences of the 
DNA fragments cloned can be determined by the dideoxy terminating method, for 
example, as described by F. Sanger, S. Nicklen, A.R. Coulson, Proceedings of National 
Academy of Science U.S.A. (1977), 74, pp. 5463-5467. For example, ABI PRISM Dye 
Terminator Cycle Sequencing Ready Reaction Kit commercially available from Perkin- 
Elmer may preferably be used. 

(List 1) 

Primer 1: aattttcaag catagccaag ttaaccacct 30mer 

Primer 2: gctcacaaga taatgatgtt agtc 24mer 

Primer 3: atacaagtga ggaacttgac ca 22mer 

Primer 4: ccaaaccata gcaaacctaa gcac 24mer 

Primer 5: acaacagaaa aatatgactc ttattact 28 mer 

Primer 6: aaaagagagt caaacatcat agtatc 26 mer 

(List 2) 

Primer 1: atggcaccac caagcataac caaaactgc 29 mer 

Primer 2: atggcaccac caagcataac caaaactgca accctccaag acg 43 mer 
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Primer 3: tcaaaataaa aactggacca aagac 25 mer 

Primer 4: tcaaaataaa aactggacca aagacaatgt 30 mer 

Primer 5: atggctccaa gcataagcaa aactg 25 mer 

Primer 6: atggctccaa gcataagcaa aactgtggaa ct 32 mer 

5 Primer?: tcaaaataaa aactcaacca ttgac 25 mer 

Primer 8: tcaaaataaa aactcaacca ttgacaattt tgaagcact 39 mer 



The term "gene fragment" as used herein refers to a gene fragment having a 
partial nucleotide sequence of the present gene (hereinafter referred to simply as the 
present gene fragment). For example, it may be a gene fragment derived from a plant and 

1 0 having a partial nucleotide sequence of the gene having a nucleotide sequence coding for a 
protein capable of producing raffmose by combining a D-galactosyl group through an 
a(l— >6) bond with a hydroxyl group attached to the carbon atom at position 6 of a 
D-glucose residue in a sucrose molecule. Specific examples of the present gene fragment 
are a gene fragment having a partial nucleotide sequence of the gene having a nucleotide 

15 sequence coding for the amino acid sequence of SEQ ID NO:l and a gene fragment 
having a partial nucleotide sequence of the gene having a nucleotide sequence of SEQ ID 
NO: 2, more specifically a gene fragment having a nucleotide sequence or a partial 
nucleotide sequence thereof, coding for any of the amino acid sequences shown in list 3 
below. 

20 These gene fragments can be used as probes in the hybridization method or as 

primers in the PGR method. For the primers in the PGR method, it is generally preferred 
that the number of nucleotides is greater from a viewpoint that the specificity of annealing 
is ensured; it is, however, also preferred that the number of nucleotides is not so great 
from viewpoints that the primers themselves are liable to have a higher structure giving 

25 possible deterioration of the annealing efficiency and that complicated procedures are 
required in the purification after the synthesis. In usual cases, preferred is a gene 
fragment consisting of single-stranded DNA, wherein the number of nucleotides is in the 
range of from 15 to 50. 
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(List 3) 
























#1 


Gly lie 




Phe 


Met 


Ser 


He Phe 


Arg 


Phe Lys 


Val 






Thr 






His Trp 


Val 


Gly 






















#2 


lie He 




Lys 


Phe 


Gly Trp Cys 


Thr 


Trp Asp 


Ala 


Phe 


Tyr 






#3 


Gly Gly 


Cys 


Pro 


Pro 


Gly Phe Val 


He 


He Asp 


Asp 


Gly 


Trp 






#4 


Thr Ser 


Ala 


Gly Glu 






Cys 


Arg Leu 


Val 


Lys 


Tyr 


Glu 


Glu 






























#5 


Val Tyr 


Val 


Trp 


His 






Gly 




Gly Gly 








#6 


Thr Met 


Glu 


Asp 


Leu 


Ala 


Val Asp 


LyI 


He Val 


Glu 


Asn 


Gly 


^1 


Gly 




Leu Val 


Pro 


Pro 






















#7 


Gly Leu 


His 


Ser 


His 


Leu 


Glu Ser 


Ala 


Gly He 


Asp 


Gly 




Lys 






Asp Val 


He 


His 


Leu 


Leu 


Glu 
















#8 


Gly Gly 




Val 


Glu 


Leu 


Ala Arg 


Ala 




Lys 


Ala 








#9 


Val Lys 




His 


Phe 


Lys 


Gly Asn 






Ala 










#10 


Gly Irg 


Cys 


Asn 


Asp 


Phe 


Phe Leu 




Gly Thr 


Glu Ala 


He 


Ser 


Leu 








Gly Asp 


Asp 


Phe Trp 


Cys 


Ser Asp 


Pro 


Ser 


Gly 


Asp 


Pro 




Asn Gly 


Thr 


Tyr 


Trp 


Leu 


Gin Gly 


Cys 




Val 


His 


Cys 






#11 






Ser 


Leu 


Trp Met Gly 






Gin 


Pro 


Asp 


Trp Asp 




Met Phe 


Gin 


Ser 


Thr 


His 


Pro Cys 




Gill Phe 


His 


Ala 


Ala 


Ser 


Arg 








Gly 


Gly 


Pro 


He Tyr 




Ser Asp 












#12 


Leu Pro 


Asp 


Gly 


Ser 


He 


Leu Arg 


Cys 














#13 


Ala Leu 


Pro 


Thr 


Arg 


Asp 


Cys Leu 


Phe 


Glu Asp 


Pro 


Leu 


His 


Asn Gly 




Lys Thr 


Met 


Leu 


Lys 


He 


Trp Asn 
















#14 


Gly Val 


Leu 


Gly Leu 


Phe 


Asn Cys 


Gin 


Gly Gly 


Gly Trp 








#15 


Phe Ala 


Pro 


He 


Gly 


Leu 


Val Asn 


Met 















The present gene fragment is labeled, and then used as a probe in the hybridi- 
zation method and hybridized to organism-derived DNA, so that aDNA fragment having 
the probe specifically bound thereto can be detected. Thus, from an organism-derived 
gene library, a raffmose synthase gene having a nucleotide sequence coding for the amino 
acid sequence of an enzyme capable of producing raffmose by combining a D-galactosyl 
group through an a(1^6) bond with a hydroxyl group attached to the carbon atom at 
position 6 of a D-glucose residue in a sucrose molecule; or a gene fragment having a 
paitial nucleotide sequence thereof, can be detected (hereinafter referred to simply as the 
present detection method). 



As the organism-derived DNA, for example, a cDNA library or a genomic 
DNA library of a desired plant can be used. The gene library may also be a commercially 
available gene library as such or a library prepared according to an ordinary library 
preparation method, for example, as described in "Molecular Cloning: A Laboratory 
5 Manual 2nd edition" (1989), Cold Spring Harbor Laboratory Press; "Current Protocols 
In Molecular Biology" (1987), John WHey & Sons, Inc. ISBN 0-471 -5033 8-X. 

As the hybridization method, plaque hybridization or colony hybridization can 
be employed, depending upon the kind of vector used in the preparation of a library. 
More specifically, when a library to be used is constructed with a phage vector, a suitable 

1 0 host microorganism is mixed with the phage under infectible conditions, which is further 
mixed with a soft agar medium, and the mixture is plated on an agar medium. Thereafter, 
a culture is grown at 37 °C until a plaque of an appropriate size appears. When a library to 
be used is constructed with a plasmid vector, the plasmid is transformed in a suitable host 
microorganism to form a transformant. The transformant obtained is diluted to a suitable 

15 concentration, and the dilution is plated on an agar medium, after which a culture is grown 
at 37°C until a colony of an appropriate size appears. 

In either case of the above Ubraries, a membrane filter is placed on the surface 
of the agar medium after the cultivation, so that the phage or transformant is transferred to 
the membrane. This membrane is denatured with an alkali, followed by neutralization, 

20 and for example, when a nylon membrane is used, the membrane is irradiated with 
ultraviolet light, so that DNA is fixed on the membrane. This membrane is then subjected 
to hybridization with the present gene fi-agment labeled by an ordinary method as a probe. 
For this method, reference may be made, for example, to D.M. Glover ed., "DNA 
cloning, a practical approach" IRL PRESS (1985), ISBN 0-947946-18-7. There are 

25 various reagents and temperature conditions to be used in the hybridization; for example, 
prehybridization is carried out by the addition of 6 x SSC (0.9 M NaCl, 0.09 M citric 
acid), 0.1-1% SDS, 100 |ig/ml denatured salmon sperm DNA, and incubation at 65°C for 
1 hour. The present gene fragment labeled is then added as a probe, and mixed. 
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Hybridization is carried out at 42-68°C for 4 to 16 hours. The membrane is washed with 
2 X SSC, 0.1-1% SDS, further rinsed with 0.2 x SSC, 0-0.1% SDS, and then dried. The 
membrane is analyzed, for example, by autoradiography or other techniques, to detect the 
position of the probe on the membrane and thereby detect the position of DNA having a 
5 nucleotide sequence homologous to that of the probe used. Thus, the present gene or the 
present gene fragment can be detected. The clone corresponding to the position of DNA 
thus detected on the membrane is identified on the original agar medium, and the positive 
clone is selected, so that the clone having the DNA can be isolated. The same procedures 
of detection are repeated to purify the clone having the DNA. 

10 Other detection methods can also be used, for example, GENE TRAPPER 

cDNA Positive Selection System Kit commercially available from Gibco BRL. In this 
method, a single- stranded DNA library is hybridized with the present gene fragment 
biotinylated (i.e., probe), followed by the addition of streptoavidin-bound magnet beads 
and mixing. From the mixture, the streptoavidin-bound magnetic beads are collected with 

15 a magnet, so that single-stranded DNA having a nucleotide sequence homologous to that 
of the probe used, which has been bound to these beads through the present gene 
fragment, biotin and streptoavidin, is collected and detected. Thus, the present gene or 
the present gene fragment can be detected. The single-stranded DNA collected can be 
changed into a double-strand form by treatment with a suitable DNA polymerase using a 

20 suitable oligonucleotide as a primer. 

The present detection method may also be used in the analysis of a plant. 
More specifically, plant genomic DNA is prepared according to an ordinary method, for 
example, as described in "Cloning and Sequence (Plant Biotechnology Experiment 
Manual)" complied under the supervision of Itaru Watanabe, edited by Masahiro Sugiura, 

25 published by Noson Bunka-sha, Tokyo (1989). The plant genomic DNA is digested with 
several kinds of suitable restriction endonucleases, followed by electrophoresis, and the 
electrophoresed DNA is blotted on a filter according to an ordinary method. This filter is 
subjected to hybridization with a probe prepared from the present gene fragment by an 



ordinary method, and DNA fragments to which the probe hybridizes are detected. The 
DNA fragments detected are compared in length between different varieties of a specified 
plant species. The differences in length make possible the analysis of differences in 
phenotypic characteristics accompanied with the expression of raffinose family oligo- 
5 saccharides between these varieties. Furthermore, when the DNA fragments detected by 
the above method are compared in length between the gene recombinant plant and the 
non-gene recombinant plant of the same variety, the former plant can be discriminated 
from the latter plant by the detection of hybridizing bands greater in number or higher in 
concentration for the former plant than for the latter plant. This mediod can be carried out 
1 0 according to the RFLP (restriction fragment length polymorphism) method, for example, 
as described in "Plant PGR Experiment Protocols" complied under the supervision of Ko 
Shimamoto and Takuji Sasaki, published by Shujun-sha, Tokyo (1995), ISBN 4-87962- 
144-7, pp. 90-94. 

The PGR method using a primer having the nucleotide sequence of the present 
15 gene fragment makes it possible to amplify from organism-derived DNA, a raffinose 
synthase gene having a nucleotide sequence coding for the amino acid sequence of an 
enzyme capable of producing raffinose by combining a D-galactosyl group through an 
a(l— >6) bond with a hydroxyl group attached to the carbon atom at position 6 of a 
D-glucose residue in a sucrose molecule; or a gene fragment having a partial nucleotide 
2 0 sequence thereof (hereinafter referred to simply as the present amplification method) . 

More specifically, for example, an oligonucleotide having 15 to 50 nucleo- 
tides in the nucleotide sequence of the present gene fragment at the 3 '-terminus is 
chemically synthesized by an ordinary synthesis method. Based on the codon table 
below, showing the correspondence of amino acids encoded in nucleotide s^uences, a 
25 mixed primer can also be synthesized so that a residue at a specified position in the primer 
is changed to a mixture of several bases, depending upon the variation of codons which 
can encode a certain amino acid. 
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CODON TABLE 



Phe 


uuu 


Ser 


ucu 


Tyr 


UAU 


Cys 


UGU 


uuc 


ucc 


UAC 


UGC 


Leu 


UUA 


UCA 


Stop 


UAA 


Stop 


UGA 


UUG 


UCG 


UAG 


Trp 


UGG 


CUU 


Pro 


ecu 


His 


CAU 


Arg 


CGU 


cue 


ecc 


CAC 


CGC 


CUA 


CCA 


Gin 


CAA 


CGA 


CUG 


ecG 


CAG 


CGG 


Ee 


AUU 


Thr 


ACU 


Asn 


AAU 


Ser 


AGU 


AUG 


ACC 


AAC 


AGC 


AUA 


ACA 


Lys 


AAA 


Arg 


AGA 


Met 


AUG 


ACG 


AAG 


AGG 


Val 


GUU 


Ala 


GCU 


Asp 


GAU 


Gly 


GGU 


GUC 


GCC 


GAC 


GGC 


GUA 


GCA 


Glu 


GAA 


GGA 


GUG 


GCG 


GAG 


GGG 



Furthermore, a base capable of forming a pair with plural kinds of bases, 
such as inosine, can also be used instead of the above mixture of several bases. More 
specifically, for example, primers having nucleotide sequences as shown in list 4 can be 
used. In this context, an oligonucleotide having the same nucleotide sequence as the 
5 coding strand of the present gene consisting of double-stranded DNA is designated "sense 
primer," and an oligonucleotide having a nucleotide sequence complementary to the 
coding strand, "antisense primer." 

A sense primer having the same nucleotide sequence as present on the 
5 '-upstream side in the coding strand of a raffinose synthase gene or a gene fragment 
1 0 having a partial nucleotide sequence thereof to be amplified, and an antisense primer 
having a nucleotide sequence complementary to the nucleotide sequence on the 
3' -downstream side in this coding strand, are used in combination for PCR reaction to 
amplify a DNA fragment, for example, with a gene library, genomic DNA or cDNA as a 
template. At this time, the amplification of a DNA fragment can be confirmed by an 
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ordinary method with electrophoresis. For the DNA fragment amplified, its restriction 
endonuclease map is constructed or its nucleotide sequence is determined by an ordinary 
method, so that the present gene or the present gene fragment can be identified. As the 
gene library used herein, for example, a cDNA library or a genomic cDNA Ubrary of a 
desired plant can be used. For the plant gene library, a commercially available library 
derived from plant can be used as such; or a library prepared according to an ordinary 
library preparation method, for example, as described in "Molecular Cloning: A Laborato- 
ry Manual 2nd edition" (1989), Cold Spring Harbor laboratory Press or "Current 
Protocols in Molecular Biology" (1987), John Wiley & Sons, Inc., ISBN 0-471-50338- 
X, can also be used. As the genomic DNA or cDNA used in the present ampUfication 
method, for example, cDNA or genomic cDNA prepared from a desired plant can be 
used. 

More specifically, for example, a primer designed on the amino acid sequence 
of SEQ ID N0:1 is used for the present amplification method with cDNA derived from 
Japanese artichoke, which is a lamiaceous plant as a template, so that a raffinose synthase 
gene fragment having the nucleotide sequence of SEQ ID NO: 6 can be amplified. 
Furthermore, for example, a primer designed on the amino acid sequence of SEQ ID 
NO:l is used for the present amplification method with cDNA derived from com, which 
is a gramineous plant, as a template, so that a raffinose synthase gene fragment having the 
nucleotide sequence of SEQ ID NO: 8 can be amplified. 

(List 4) 

1- F 32mer 

TT I AAIGT I TGGTGGAC I AC IC AITGGGTIGG 

2- F 41mer 

AT I AT IGAI AAI TT I GGI TGGTGI AC I TGGGAI GC I TTI TA 

2- RV 41mer 

TAIAAIGCITCCCAIGTICACCAICCIAAITTITCIATIAT 

3- F 44mer 

GGIGGITGICCICCIGGITTIGTIATIATIGAIGAIGGITGGCA 
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44mer 

TGCCAICCITCITCIATIATIACIAAICCIGGIGGICAICCICC 

32mer 

AAIAAICAITTIAAIGGIAAIGGIGTIATIGC 

32mer 

GCIATIACICCITTICCITTIAAITGITTITT 

38mer 

TGGATGGGIAAITTIATICAICCIGAITGGGAIATGTT 

38mer 

AACATITCCCAITCIGGITGIATIAAITTICCCATCCA 

27raer 

CATITTIACIA(AG) ICCIATIGGIGCIAA 

The present amplification method can also be utilized for the analysis of a 
plant gene. More specifically, for example, plant genomic DNA prepared from different 
1 5 varieties of a specified plant species is used as a template for the present amplification 
method to amplify a DNA fragment. The DNA fragment amplified is mixed with a 
solution of formaldehyde, which is denatured by heating at 85°C for 5 minutes, followed 
by rapid cooling on ice. This sample is subjected to electrophoresis, for example, on a 
6% polyacrylamide gel containing 0% or 10% glycerol. In this electrophoresis, a commer- 
20 ciaUy available electrophoresis apparatus for SSCP (single strand conformation polymor- 
phism) can be used, and electrophoresis is carried out, while the gel is kept at a constant 
temperature, e.g., 5°C, 25°C or 37°C. From the electrophoresed gel, a DNA fragment is 
detected, for example, by a method such as silver staining method with commercially 
available reagents. 

25 From the differences of behavior between the varieties in the electrophoresis 

of the DNA fragment detected, a mutation in the raffinose synthase gene is detected, and 
an analysis is carried out for differences caused by the mutation in phenotypic character- 
istics accompanied with the expression of raffinose family oligosaccharides. This method 
can be carried out according to the SSCP method, for example, as described in "Plant 

3 0 PCR Experiment Protocols" complied under the supervision of Ko Shimamoto and Takuji 



3- RV 

4- F 

5 4-RV 

5- F 

5- RV 

10 

6- RV 
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Sasaki, published by Shujun-sha, Tokyo (1995), ISBN 4-87962-144-7, pp. 141-146. 

The present detection method or the present amplification method as described 
above can also be used for identifying a raffinose synthase gene or a gene fragment 
having a partial nucleotide sequence thereof and then isolating and purifying the identified 
5 gene or gene fragment thereof to obtain the present gene (hereinafter referred to simply as 
the present gene acquisition method). 

The present gene or the present gene fragment can be obtained, for example, 
by detecting a probe consisting of the present gene fragment hybridized to DNA in the 
organism-derived gene library by the present detection method as described above to 

10 identify DNA having a nucleotide sequence homologous widi the probe used; purifying a 
clone carrying the DNA; and isolating and purifying plasmid or phage DNA from the 
clone. When the DNA thus obtained is a gene fragment having a partial nucleotide 
sequence of the raffinose synthase gene, further screening of the gene library by the 
present gene detection method using the DNA as a probe gives the present gene in full 

15 length. 

The present gene or the present gene fragment can be identified, for example, 
by effecting polymerase chain reaction using a primer having the nucleotide sequence of 
the present gene fragment to amplify a DNA fragment from the organism-derived DNA by 
the present amplification method as described above; and then constructing a restriction 

20 endonuclease map or determining a nucleotide sequence for the amplified DNA fragment. 
B ased on the nucleotide sequence of the gene fragment obtained, an antisense primer is 
synthesized for the analysis of 5 '-terminal sequences, and a sense primer is synthesized 
for the analysis of 3 '-terminal sequences. The nucleotide sequence of the present gene in 
full length can be determined by the RACE method using these primers and a commer- 

25 cially available kit, e.g., Marathon Kit of Clontech. The present gene in full length can be 
obtained by synthesizing new primers based on both terminal sequences in the nucleotide 
sequence thus determined and effecting polymerase chain reaction again. 

The present gene acquisition method as described above makes it possible to 
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obtain raffinose synthase genes as the present gene from various organisms. For 
example, a gene coding for a raffinose synthase having an amino acid sequence that has 
about 50% or higher homology, in the region corresponding to the length of 400 or more 
amino acids, with the amino acid sequence of SEQ ED NO:l, and capable of producing 
5 raffinose by combining a D-galactosyl group through an a(1^6) bond with a hydroxyl 
group attached to the carbon atom at position 6 of a D-glucose residue in a sucrose 
molecule. More specifically, for example, a raffinose synthase gene having the nucleotide 
sequence of SEQ ID NO:4 can be obtained by ampUfying and identifying a DNA fragment 
containing a gene fragment having a partial nucleotide sequence of the raffinose synthase 

1 0 gene by the present ampUfication method using primers designed from the amino acid 
sequence of SEQ ID NO:l and soybean cDNA as a template; isolating and purifying the 
identified DNA fragment, followed by the above procedures to obtain a full-length gene 
containing the DNA fragment. 

A chimera gene comprising the present gene and a promoter linked thereto 

1 5 (hereinafter referred to simply as the present chimera gene) can be constructed. 

The promoter to be used is not particularly limited, so long as it is function- 
able in a host organism to be transformed. The promoter may include, for example, 
synthetic promoters functionable in Escherichia coli, such as E. coli lactose operon 
promoter, E. coli tryptophan operon promoter and tac promoter; yeast alcohol dehydroge- 

20 nase gene (ADH) promoter, adenovirus major late (Ad.ML) promoter, SV40 early 
promoter, and baculovirus promoter. 

When the host organism is a plant or a cell thereof, the promoter may include, 
for example, T-DNA derived constitutive promoters such as nopaline synthase gene 
(NOS) promoter and octopine synthase gene (OCS) promoter; plant virus-derived 

25 promoters such as cauliflower mosaic virus (CaMV) derived 19S and 35S promoter; 
derived promoters such as phenylalanine ammonia-lyase (PAL) gene promoter, chalcone 
synthase (CHS) gene promoter and pathogenesis-related protein (PR) gene promoter. 
Furthermore, vector pSUM-GYl (see JP-A 06-189777/1994) can also be used, which 



has a promoter giving specific expression in a specified plant tissue, e.g., a promoter 
of soybean-derived seed storage protein glycinin gene. The use of a chimera gene 
constructed so as to have such a promoter makes it possible to increase or decrease the 
content of raffmose family oligosaccharides in a specified tissue of a plant. 
5 The present chimera gene is then introduced into a host organism according to 

an ordinary gene engineering method to give a transformant. If necessary, the present 
chimera gene may be used in the form of a plasmid, depending upon the transformation 
method for introducing the gene into the host organism. Furthermore, the present chimera 
gene may contain a terminator. In this case, it is generally preferred that the chimera gene 

10 is constructed so as to have a terminator downstream the raffinose synthase gene. The 
terminator to be used is not particularly limited, so long as it is functionable in a host 
organism to be transformed. For example, when the host organism is a plant or a cell 
thereof, the terminator may include, for example, T-DNA derived constitutive terminators 
such as nopaline synthase gene (NOS) terminator; and plant derived terminators such as 

1 5 terminators of allium virus GVl or GV2. 

If necessary, the present gene may be used in the form of a plasmid. For 
example, when the host organism is a microorganism, the plasmid constructed is 
introduced into the microorganism by an ordinary means, for example, as described in 
"Molecular Cloning: A Laboratory Manual 2nd edition" (1989), Cold Spring Harbor 

20 laboratory Press or "Current Protocols in Molecular Biology" (1987), John Wiley & 
Sons, Inc., ISBN 0-471-50338-X. The microorganism thus transformed is selected with 
a marker such as antibiotic resistance or auxotrophy. When the host organism is a plant, 
the plasmid constructed is introduced into a plant cell by an ordinary means such as 
infection with Agrobacterium (see JP-B 2-58917/1990 and JP-A 60-70080/1985), 

25 electroporation into protoplasts (see JP-A 60-251887/1985 and JP-B 5-68575/1993) or 
particle gun method (see JP-A 5-508316/1993 and JP-A 63-258525/1988). The plant cell 
transformed by the introduction of a plasmid is selected with an antibiotic such as 
kanamycin or hygromycin. From the plant cell thus transformed, a transformed plant can 
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be regenerated by an ordinary plant cell cultivation method, for example, as described in 
"Plant Gene Manipulation Manual (How to Produce Transgenic Plants)" written by 
Uchimiya, 1990, Kodan-sha Scientific (ISBN 4-06-153513-7), pp. 27-55. Furthermore, 
the collection of seeds from the transformed plant also makes it possible to prolify the 
5 transformed plant. In addition, crossing between the transformed plant obtained and the 
non-transformed plant makes it possible to produce progenic plants with the character of 
the transformed plant. 

The content of raffinose family oligosaccharides can be changed by introduc- 
ing the present gene into a host organism or a cell thereof, and modifying the metabolism 

10 in the host organism or the cell thereof. As such a method, for example, there can be used 
a method for metabolic modification to increase the amount of raffinose family oligo- 
saccharides in a host organism or a cell thereof by constructing the present chimera gene 
comprising the present gene and a promoter linked thereto, in which case the present gene 
is linked to the promoter in an original direction suitable for transcription, translation, and 

15 expression as a protein, and then introducing the present chimera gene into the host 
organism or the cell thereof; or a method for metabolic modification to decrease the 
amount of raffinose family oligosaccharides in a host organism or a cell thereof by 
constructing the present chimera gene comprising the present gene and a promoter linked 
thereto, in which case the present gene is linked to a promoter in a reverse direction 

20 unsuitable for translation and expression as a protein, and then introducing the present 
chimera gene into the host organism or the cell thereof. 

The term "raffinose synthase protein" as used herein refers to a protein 
encoded in the present gene (hereinafter referred to simply to the present protein). For 
example, it may include an enzyme protein having the amino acid sequence of SEQ ID 

25 N0:1 or SEQ ID NO: 3, or having an amino acid sequence derived by deletion, replace- 
ment, modification or addition of one or several amino acids in the amino acid sequence of 
SEQ ID NO:l or SEQ ID NO:3; and capable of producing raffinose by combining a 
D-galactosyl group through an a(l->6) bond with a hydroxyl group attached to the 



caibon atom at position 6 of a D-glucose residue in a sucrose molecule. 

Specific examples of the present protein are an enzyme protein having the 
amino acid sequence of SEQ ID N0:1 (799 amino acids; molecular weight, 89 kDa) and 
an enzyme protein having the amino acid of SEQ ID N0:3 (781 amino acids; molecular 
5 weight, 87 kDa). 

The present protein, although it can be prepared, for example, from legumi- 
nous plants such as broad bean (Viciafaba), by an ordinary biochemical method such as 
(NH4)2S04 precipitation, ion exchange column, hydrophobic column, hydroxy apatite 
column or gel filtration column, can also be prepared from the host organism transformed 

1 0 with the present plasraid, or a ceU thereof. More specifically, for example, using GST 
Gene Fusion Vectors Kit of Pharmacia, the present gene is inserted into an expression 
vector plasmid attached to the kit. The resulting vector plasmid is introduced into a 
microorganism such as E. coli according to an ordinary gene engineering method. A 
culture of the transformant obtained is grown on a medium with the addition of IPTG 

1 5 (isopropylthio-p-D-galactoside), so that the present protein can be expressed and derived 
as a fused protein in the culture. The fused protein expressed and induced can be isolated 
and purified by an ordinary method such as disruption of bacterial cells, colimm operation 
or SDS-PAGB electrophoresis. The digestion of the fiised protein with a protease such as 
thrombin or blood coagulation factor Xa gives the present protein. This may preferably 

20 be made, for example, according to the method described in "Current Protocols In Protein 
Science" (1995), John Wiley & Sons, Inc. ISBN 0-471-11184-8. The activity of the 
present protein can be measured, for example, by the method described in L. Lehle and 
W. Tanner, Eur. J. Biochem., 38, 103-1 10 (1973). 

An anti-raffinose synthase antibody capable of binding to a raffmose synthase 

25 protein (hereinafter referred to simply as the present antibody) can be produced by an 
ordinary immunological method using the present protein prepared above, as an antigen. 
More specifically, the present antibody can be produced, for example, according to the 
method described in Ed Harlow and David Lane, "Antibodies: A Laboratory Manual" 



(1988), Cold Spring Harbor Laboratory Press, ISBN 0-87969-314-2. 

The present protein can be detected by treating test proteins with the present 
antibody and detecting a protein having the present antibody bound specifically thereto. 
Such a detection method can be carried out according to an immunological technique such 
5 as Western blot method or enzyme-linked immunosorbent assay (ELISA), for example, as 
described in Ed Harlow and David Lane, "Antibodies: A Laboratory Manual" (1988), 
Cold Spring Harbor Laboratory Press. 

The Western blot method is carried out, for example, as follows: Proteins are 
extracted from a plant, for example, according to the method described in Methods in 

10 Enzymoiogy, volume 182, "Guide to Protein Purification," pp. 174-193, ISBN 9-12- 
1 82083-1 . The composition of an extraction buffer can suitably be changed depending 
upon the plant tissue used. The proteins extracted are electrophoresed according to an 
ordinary SDS-PAGE method. The proteins electrophoresed in the gel are transferred to a 
membrane by Western blotting with an ordinary electrical method. More specifically, for 

15 example, the gel is immersed in a transfer buffer (25 mM Tris, 192 mM glycine, 20% 
methanol) for 10 minutes, and then placed onto a PVDF membrane cut into the same size 
as that of the gel. The gel together with the membrane is set in a commercially available 
transfer apparatus of the semi-dry type. Blotting is carried out at a constant current of 
0.8 to 2 mA/cm^ for 45 minutes to 1 hour. The proteins transferred to the membrane can 

20 be detected immunologically with a kit for Western blot detection using a primary 
antibody, and a secondary antibody or protein A, which has been labeled with alkaline 
phosphatase or horseradish peroxidase. At this time, the present protein on the membrane 
can be detected by the use of the present antibody as a primary antibody. 

In the ELISA method, for example, the property of proteins binding to the 

25 surface of a 96- well ELISA plate made of a resin is utihzed in principle for the immuno- 
logical detection of an antigen finally bound to the surface of the ELISA plate. The test 
proteins are added as a solution and bound to an ELISA plate, followed by blocking, for 
example, by the addition of PBS containing a protein such as 5% bovine serum albumin. 



Thereafter, the well is washed with PBS, to which a solution containing the present 
antibody is added to effect the reaction. After the well is washed, a solution containing a 
secondary antibody labeled with alkaline phosphatase or horseradish peroxidase is further 
added to the well, followed by washing. Finally, a substrate solution for detection is 
5 added to the well, and the color development of the substrate is detected with an ELISA 
reader. 

In another method, the present antibody is added and bound to an ELISA 
plate, followed by blocking, for example, by the addition of PBS containing a protein 
such as 5% bovine serum albumin. The test proteins are then added as a solution, and an 

1 0 antigen contained in the test proteins is bound to the present antibody that has been bound 
to the plate, followed by washing, and the present antibody is further added to the well. 
The present antibody used at this time is preferably one prepared from an animal species 
different from that used for the preparation of the present antibody used first. A solution 
containing a secondary antibody labeled with alkaline phosphatase or horseradish peroxi- 

1 5 dase is then added to the well, followed by washing. The secondary antibody used at this 
time must have the property of binding to the present antibody added later. Finally, a 
substrate solution for detection is added, and the color development is detected with an 
ELISA reader. 

Examples 

20 The present invention will be further illustrated by the following examples; 

however, the present invention is not limited to these example in any way whatsoever. 
Example 1 (Purification of Galactinol) 

About 250 ml of sugar beet blackstrap molasses was five-fold diluted with 
methanol. The dilution was centrifiiged at 21,400 x g for 15 minutes at room temperature 
25 to remove insoluble matter. The supernatant obtained was transferred Into a 2-liter 
Erlenmeyer flask, to which isopropanol at a half volume was added portionwise with 
stirring. The flask was left at room temperatiu-e for a while until the resulting precipitate 
adhered to the wall of the flask. The supernatant was then discarded by decantation. To 



the precipitate was added 500 ml of ethanol, and the mixture was washed by stirring with 
a rotary shaker. The washing was further repeated several times. The washed precipitate 
was scraped off from the wall of the flask, followed by air drying on a filter paper. The 
air-dried precipitate (dry powder) was dissolved in purified water to about 40% (w/v). To 
5 this solution was added AG501-X8(D) of BioRad, followed by stirring. This operation 
was repeated until the color of the solution became almost unobserved. The resulting 
solution was treated with a Sep-Pak QMA column of Millipore, and further pretreated 
with Sep-Pak CM, Sep-Pak CI 8 and Sep-Pak Silica columns of Millipore, The resulting 
solution was loaded at a volume of 5 ml onto a column of Wako-gel LP40C18 (Wako 

10 Pure Chemical Industries, 2.6 cm x 85 cm), and eluted with purified water. The sugar 
content of the eluate was measured with a portable sugar refractometer, and the sugar 
composition was analyzed by high performance liquid chromatography (HPLC) with a 
Sugar-pak Na (7.8 mm x 300 irrai) column of Millipore. The detection of sugars was 
carried out with model 410 Differential Refractometer of Waters. The eluate containing 

1 5 galactinol was lyophilized, and the resulting lyophilized powder was dissolved in 5 ml of 
purified water. The solution was loaded onto a column of TOYOPEARL HW40(S) 
(Toso, 2.6 cm x 90 cm), and eluted with purified water. The eluate was analyzed in the 
same manner as described above, so that pvuified galactinol was obtained. 

The galactinol obtained was kept at 25 °C for 40 minutes in the reaction 

20 mixture that came to contain 80 mM phosphate buffer (pH 6.5), 2 mg/ml galactinol, and 
8.3 U a-galactosidase (Boehringer Mannheim, E. coli overproducer 662038). The 
reaction mixture was extiacted with chloroform, and the water layer was analyzed by 
HPLC. The resulting galactinol was confirmed to be hydrolyzed into galactose and 
myo-inositol. 

25 Example 2 (Measurement of Raffinose Synthase Activity) 

The raffinose synthase activity was measured under the following conditions 
according to the description of L. Lehle and W. Tanner, Eur. J. Biochem., 38, 103-110 
(1973). 



First, 2 of a sample to be used in the measurement of activity was added to 
1 8 ^1 of the reaction mixture that came to contain 100 mM Tris-HCl (pH 7.4), 5 mM DTT 
(dithiothreitol), 0.01% BSA, 200 jiM sucrose, 5 mM galactinol, 740 KBq/ml (31.7 jiM) 
[^^C] sucrose, and the reaction mixture was kept at 37°C for 3 to 20 hours. After the 
5 reaction, 30 )il of ethanol was added to the reaction mixture, followed by stirring and 
centrifugation at 15,000 rpm for 5 minutes. The supernatant was spotted at a volume of 
5 ^^1 on an HPTLC plate of cellulose for thin layer chromatography (Merck, 10 cm x 
20 cm), and developed with n-butanol : pyridine : water : acetic acid = 60 : 40 : 30 : 3. 
The developed plate was dried and then quantitatively analyzed with an imaging analyzer 
1 0 (Fuji Photographic Film, FUJIX Bio Imaging Analyzer B AS-2000II) for the determina- 
tion of [^'^C] raffinose produced. 

Example 3 (Purification of Raffinose Synthase) 

The purification of raffinose synthase from broad bean was carried out as 
follows: For each purified protein solution, proteins present in the protein solution were 

1 5 analyzed by SDS-PAGE (Daiichi Kagaku Yakuhin), and the enzyme activity thereof was 
measured according to the method described in Example 2. 

First, 300 g of immature seeds of broad bean (Nintoku Issun) stored at -80°C 
was thawed and then peeled. The peeled seeds were put in 600 ml of 100 mM Tris-HCl 
(pH 7.4), 5 mM DTT (dithiothreitol), 1 mM EDTA, 1 mM PMSF (phenyhnethylsulfonyl 

20 fluoride) and 1 mM benzamide, and ground on ice with a mortar. The ground material 
was centrifuged at 21,400 x g for 50 minutes at 4°C. To the resulting supernatant was 
added 10% polyethylene imine (pH 8.0) at a 1/20 volume. The mixture was stirred at 4°C 
for 15 minutes, and centrifuged at 15,700 x g for 20 minutes at 4°C. To the resulting 
supernatant was added 196 g/1 of (NH4)2S04 with stirring. The mixture was stirred in ice 

25 for 30 minutes, and centrifuged at 15,700 x g for 20 minutes at 4°C. To the resulting 
supernatant was further added 142 g/1 of (1^4)2804 with stirring. After the stirring in 
ice for 30 minutes, the mixture was centrifuged at 15,700 x g for 20 minutes at 4°C. The 
resulting precipitate was dissolved in 50 ml of 100 mM Tris-HCl (pH 7.4) and 5 mM 
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DTT (dithiothreitol), and the solution was dialyzed against 20 mM Tris-HCl (pH 7.4), 
1 mM DTT (dithiothreitol) and 1 mMEDTA at 4°C overnight. After the dialysis, the 
suspension was centrifuged at 70,000 x g for 60 minutes at 4°C. To the resulting 
supernatant was added 1 mM benzamidine • HCl, 5 mM e-amino-n-caproic acid, 1 )i.g/ral 
5 antipain, 1 |J,g/ml leupeptin and 10 mM EGTA, and 2 M KCl was further added 
portion wise at a 1/40 volume. The mixture was loaded onto a column of DEAE-Sephacel 
(Pharmacia, 2.5 cm x 21.5 cm) equilibrated with 0.05 M KCl, 20 mM Tris-HCl (pH 
7.4), 1 mM DTT (dithiothreitol) and 1 mM EDTA, and the adsorbed proteins were eluted 
with a gradient of 0.05 to 0.5 M KCL The purification steps up to this stage were 

1 0 repeated three times, and fractions having raffinose synthase activity were combined and 
then purified as follows: 

To the eluted fraction having raffinose synthase activity was added portion- 
wise saturated (NH4)2S04 at a 1/4 volume. The solution was loaded onto a column of 
Phenyl-Sepharose (Pharmacia, 2.5 cm x 10.2 cm) equilibrated with 20% saturated 

15 (NH4)2S04, 20 mM Tris-HCl (pH 7.4), 1 mM DTT (dithiothreitol) and 1 mMEDTA, 
and the adsorbed proteins were eluted with a gradient of 20% to 0% (NH4)2S04. The 
resulting active fraction was diluted by the addition of 0.01 M potassium phosphate buffer 
(pH 7.5) at a 2-fold volume. The diluted solution was loaded onto a column of Econo- 
Pac lODG (BioRad, 5 ml) previously equilibrated with 0.01 M potassium phosphate 

20 buffer (pH 7.5) and 2 mM DTT (dithiothreitol), and the adsorbed proteins were eluted 
with a gradient of 0.01 to 0.5 M potassium phosphate buffer (pH 7.5) and 2 mM DTT 
(dithiothreitol). The active fraction obtained at this stage was found to have been purified 
up to 6500-fold or higher specific activity. Part of the resulting purified protein solution 
having raffinose synthase activity was loaded onto a column of Superdex 200 (Pharmacia, 

25 1.6 cm X 60 cm) equilibrated with 0.2 M KCl, 20 mM Tris-HCl (pH 7.4), 1 mM DTT 
(dithiotiireitol) and 1 mM EDTA. The purified proteins thus separated were subjected to 
SDS-PAGE, and the raffinose synthase activity was measured. A protein band having 
raffinose synthase activity was identified as having a molecular weight of about 90 kDa on 
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the SDS-PAGE. 

Example 4 (Analysis of Partial Amino Acid Sequence of Raffinose Synthase) 
To about 1 ml of the purified protein solution, which had been purified with a 
column of Econo-Pac lODG (BioRad, 5 ml) in Example 3, was added 100% TCA at a 1/9 
5 volume, and the mixture was left on ice for 30 minutes. After centrifugation at 10,000 x g 
for 15 minutes, the resulting precipitate was suspended in 500 |il of cold acetone (-20°C), 
followed by further centrifugation. This acetone washing was repeated, and the collected 
precipitate was dried and then dissolved in 200 ^1 of SDS-sample buffer, followed by 
SDS-PAGE. CBB staining was effected for the electrophoresed gel, from which the band 

10 of a raffinose synthase protein was cut out. 

To the gel thus taken was added 1 ml of 50% acetonitrile and 0.2 M ammo- 
nium carbonate (pH 8.9), and washing was continued with stirring at room temperature 
for 20 minutes. The gel was washed once again in the same manner, and dried under 
reduced pressure to an extent giving a volume reduction. To this gel was 1 ml of 0.02% 

1 5 Tween-20 and 0.2 M ammonium carbonate (pH 8.9), and the mixture was stirred at room 
temperature for 15 minutes. After removal of the solution, 400 jil of 8 M urea and 0.4 M 
NH4HCO3 was added, to which 40 |il of 45 mM DTT (dithiothreitol) was further added, 
and the mixture was left at 50°C for 20 minutes. After complete return to room tempera- 
ture, 4 jil of 1 M iodoacetic acid was added, and the mixture was stirred in the dark at 

20 room temperature for 20 minutes. After removal of the solution, 1 ml of purified water 
was added, and the mixture was stirred at room temperature for 5 minutes, followed by 
washing. After further two washings, 1 ml of 50% acetonitrile and 0.2 M ammonium 
carbonate (pH 8.9) was added, and the mixture was stirred at room temperature for 
15 minutes. The same treatment was repeated once again, after which the solution was 

25 removed, and the gel was dried under reduced pressure to an extent giving a volume 
reduction. 

To this gel was added a solution of Achromobacter Protease I (Takara, 
Residue-specific Protease Kit) at a volume of 100 |il. Further added was 0.02% 



Tween-20 and 0.2 M ammonium carbonate (pH 8.9) to an extent that the gel was not 
exposed from the surface of the solution, and the mixture was left at 37 °C for 42 hours. 
Further added was 500 p.1 of 0.09% TFA and 70% acetonitrile, and the nuxture was 
stirred at room temperature for 30 minutes. The resulting mixture as contained in a 
5 sample tube was floated in an ultrasonic bath, followed by ultrasonic treatment 
(BRANSON, 60 W output power) for 5 minutes. The tube and contents thus treated were 
centrifuged, and the resulting extract was collected in another silicone-coated sample tube. 
On the other hand, 500 ^il of 0.09% TFA and 70% acetonitrile was added again to the 
precipitate, followed by repeated extraction in the same manner as described above. The 

1 0 resulting extracts were combined and then concentrated under reduced pressure to an 
extent giving a solution remained at a volume of 200 to 300 |il. To the concentrate was 
added 25 jil of 8 M urea and 0.4 M NH4HCO3, and the mixture was concentrated to an 
extent giving a solution remained at a volume of 100 |j.l or lower. The concentrate was 
brought to about 100 \x\ with purified water, and the mixture was filtered through a filter 

15 of Ultrafree C3 GV (Millipore). The filtrate obtained was then subjected to elution 
through a column of Aquapore BU-300 C-4 (2.1 mm x 300 mm) by a gradient of 0.1% 
TFA/2.1% to 68.6% acetonitrile. Absorbance at 215 nm was monitored to collect a 
fraction at a peak thereof. The sample collected was evaporated under reduced pressure to 
complete dryness, and then analyzed with a Protein Sequencer 473A of ABI to determine 

20 a partial amino acid sequence of a raffinose synthase. 

Example 5 (Preparation of cDNA) 

About 2 g of immature seeds of broad bean (Nintoku Issun) was frozen in 
liquid nitrogen and then ground with a mortar, to which 20 ml of Isogen (Nippon Gene) 
was added, and the mixture was further thoroughly ground. The ground material was 
25 transferred into a centrifugation tube, to which 4 ml of chloroform was added, and the 
mixture was stirred with a vortex mixer and then centrifuged at 6,500 x g for 10 minutes 
at 4°C. The water layer was collected, to which 10 ml of isopropanol was added, and the 
mixture was stirred and then centrifuged at 6,500 x g for 10 minutes at 4°C. The resulting 



precipitate was washed with 10 ml of 70% ethanol and then dissolved in 1 ml of elution 
buffer (10 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.1% SDS). The solution was left at 
60°C for 10 minutes and then centrifuged at 10,000 x g for 1 minute to remove insoluble 
matter. To the resulting supernatant was added an equivalent volume of Oligotex-dT30 
5 (Takara), and the mixture was stirred and then left at 65°C for 5 minutes. The mixture 
was placed on ice and then left for 3 minutes, to which 200 p.1 of 5 M NaCl was added, 
and the mixture was left at 37°C for 10 minutes. The mixture was then centrifuged at 
10,000 X g at 4°C for 3 minutes. The precipitate was collected and then suspended in 

1 ml of TE buffer, and the suspension was left at 65 °C for 5 minutes, which was placed 
10 on ice and then left for 3 minutes, followed by centrifugation at 10,000 x g for 3 minutes 

at 4°C to remove the precipitate. 

To the resulting supernatant were added 100 |il of 3 M sodium acetate and 

2 ml of ethanol, and RNA was ethanol precipitated and collected. The collected RNA was 
washed twice with 70% ethanol and then dissolved in 20 (il of sterilized water, which was 

1 5 used for the subsequent cDNA synthesis. The amount of RNA obtained was determined 
by the measurement of absorbance at 260 nm. 

For the cDNA synthesis. First Strand Synthesis Kit for RT-PCR (Amercham) 
and cDNA Synthesis Kit (Takara) were used, and all operations were made according to 
the protocol. 

20 Example 6 (Nucleotide sequence Analysis of Raffinose Synthase Gene from 

cDNA) 

Based on the amino acid sequence obtained in Example 4, mixed synthetic 
DNA primers having the nucleotide sequences shown in list 5 below were synthesized. 
The PGR method was carried out with Gene Amp PGR Systems 2400 and DNA Thermal 
25 Gycler Model 480 of Perkin-Elmer using Advantage KlenTaq cDNA Kit of Glontech. 
The polymerase chain reaction was effected with the above primers at 94°G for 1 minute, 
at 50 °G for 3 minutes, and at 72°G for 3 minutes, and this reaction was repeated forty 
times. As a result, the combinations of primers 8.2 and 13.3RV, primers 13.4 and 



10.3RV, and primers 7.4 and 10.3RV, having the nucleotide sequences shown in list 5 
below, gave an amplification of 1.2 kb, 0.5 kb, and 1.2 kb bands, respectively. These 
amplified DNA fragments were cloned with a TA cloning kit (Invitrogen), followed by 
sequence reaction using ABI PRISM Dye Terminator Cycle Sequencing Ready Reaction 
5 Kit of Perkin-Elmer and nucleotide sequence analysis with a 373S DNA sequencer of 
ABI. As a result, these DNA fragments were found to have a nucleotide sequence 
extending from base 813 to base 1915, base 1936 to base 2413, and base 1226 to base 
2413, respectively, in the nucleotide sequence of SEQ ID NO:2. Based on these 
nucleotide sequences, synthetic DNA primers having nucleotide sequences shown in list 6 
1 0 below were prepared, and the nucleotide sequences in both terminal regions of cDNA 
were analyzed with Marathon cDNA Amplification Kit of Clontech. As a result, the 
nucleotide sequence of SEQ ID NO:2 was finally determined. 

(List 5) 
#8.2 26mer 

15 AA(AG) AC(ATGC) GC (ATGC) CC(ATGC) AG(TC) AT(TCA) AT(TCA) GAG AA 

#13.4 20mer 

AA(AG) AT(TCA) TGG AA{TC) CT(ATGC) AAC AA 

#7.4 24mer 

AA(AG) GC{ATGC) AG (AG) GT(ATGC) GT (ATGC) GT (ATGC) CC(ATGC) AAG 

20 #13.3RV 21mer 

(TC)TT (AG)TT (ATGC) AG (AG)TT CCA (AGT)AT TTT 

#10.3RV 21mer 

(TC)TT (AG)TC (TC)TC (AG) TA (ATGC) AG (AG)AA TTT 

(List 6) 

25 RS-2RV 30mer 

GGCTGAGGTTCGGTTCATTCCTGAATCATC 

RS-7 30mer 

CCAAATGGTACATATTGGCTCCAAGGTTGT 

RS-8 30mer 

30 AAGAGTGTATCTGAATTTTCACGCGCGGTG 
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RS-9 30mer 

TGGTGCAATGGGAAAACTCCAATGAGCACC 

RS-10 30mer 

ATGAAGTGTTCTGATAGATTGAAAGTTTCG 

5 RS-11 30mer 

CAGTCTCTGGAGTTTGATGATAATGCAAGT 

Example 7 (Cloning of Raffinose Synthase Gene from Broad Bean cDNA) 
The primers designed from the amino acid sequence of SEQ ID N0:1, i.e., 
primers having nucleotide sequences shown in list 7 below, were synthesized. Using 

1 0 these primers and cDNA obtained in Example 5 as a template, a DNA fragment of the 
open reading frame region was amplified by PGR under the conditions described in 
Example 6. The amplified DNA fragment was digested with the restriction endonucleases 
Bam HI and Xba I whose recognition sequences were contained in the primers used. 
Using Ligation Kit (Takara), the DNA fragment thus digested was cloned in the plasmid 

15 pBluescriptll KS- (Stratagene) previously digested with Bam HI and Xba I. The 
nucleotide sequence of the cloned DNA fragment was confirmed with ABI PRISM Dye 
Terminator Cycle Sequencing Ready Reaction Kit of Perkin-Elmer. In the clone thus 
obtained, it was found that the base at position 1591 in the nucleotide sequence of SEQ ID 
NO:2 had been changed from thymine (T) to cytosine (G). This was, however, a 

20 nonsense mutation without a change of the amino acid; therefore, this clone was 
designated pBluescriptKS-RS, and used in the subsequent experiment. 

(List 7) 
RS-N 41mer 

CGCGGATCCACCATGGCACCACCAAGCATAACCAAAACTGC 

25 RS-C 37mer 

TGCTC TAGATTATC AAAATAAAAACTGGACCAAAGAC 

Example 8 (Expression of Broad Bean Raffinose Synthase Gene in E. coli) 
The plasmid pBluescriptKS-RS having the broad bean raffinose synthase 
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gene obtained in Example 7 was digested with Bam HI and Not I, and cloned in the 
plasmid pGEX-4T3 (Pharmacia) digested with Bam HI and Not I to give the plasmid 
pGEX-RS as shown in Figure 1. 

The plasmid pBluescriptKS-RS was digested with Nco I and Xba I, and 
5 cloned in the plasmid pTrc99A (Pharmacia) digested with Nco I and Xba I to give the 
plasmid pTrc-RS as shown in Figure 1. 

These plasmids were introduced into E. coli strain HBlOl, and the resulting 
transformants were used for the confirmation of raffinose synthase expression. Overnight 
cultures of the transformants were inoculated at a volume of 1 ml each into 100 ml of LB 

1 0 medium and incubated at 37°C for about 3 hours, followed by the addition of IPTG 
(isopropylthio-P-D-galactoside) to a final concentration of 1 mM and further incubation 
for 5 hours. The cultures were centrifuged at 21,400 x g for 10 minutes, and the bacterial 
cells were collected. The collected bacterial cells were stored at -80°C. To the frozen 
bacterial cells was added a 10-fold volume of 100 mM Tris-HCl (pH 7.4), 1 mMEDTA, 

1 5 5 mM DTT (dithiothreitol), 1 mM PMSF (phenyhnethylsulfonyl fluoride) and 1 mM 
benzamide, and the bacterial cells were thawed and suspended. These suspensions were 
treated with an ultrasonic disrupter (Branson) to effect the disruption of the bacterial cells. 
The disrupted cell mixtures obtained were centrifuged at 16,000 x g for 10 minutes, and 
soluble protein solutions were collected. 

20 The protein solutions thus obtained were used at a volume of 4 |il each for the 

measurement of raffinose synthase activity according to the method described above. The 
reaction was effected at 37°C for 64 hours. As a control, E. coli strain HBlOl that had 
been transformed with one of the vectors, pGEX-4T3, was used. The results are shown 
in Table 1. The synthesis of raffinose was detected in the samples from the transformants 

25 HB 101 (pGEX-RS) and HB 101 (pTrc-RS). 
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TABLE 1 



Transformant 


Amount of 
raffinose produced 
(pmol) 


HBlOl (pGEX4T-3) 


0.56 


HE 101 (pGEX-RS) 


10.50 


HBlOl (pTrc-RS) 


11.10 



Example 9 (Cloning of Raffinose Synthase Gene from Soybean cDNA) 
In the same manner as described in Example 5, cDNA was obtained jfrom 
immature seeds of soybean (Glycine max) Williams 82. Using this cDNA as a template 
and primers designed from the amino acid sequence of SEQ ID NO:l, i.e., primers 
5 having nucleotide sequences shown in list 8 below, a DNA fragment was amplified by 
PGR under the conditions described in Example 6. The DNA fi:agment thus amplified by 
PGR was cloned with a TA cloning kit (Invitrogen), followed by sequence reaction using 
ABI PRISM Dye Terminator Cycle Sequencing Ready Reaction Kit of Perkin-Elmer and 
nucleotide sequence analysis with a 373S DNA sequencer of ABI. Based on this 

1 0 sequence, primers having nucleotide sequences shown in list 9 below were synthesized. 
The synthesis of cDNA was carried out with Marathon Kit of Clontech using mRNA 
obtained in the same manner as described in Example 5 from leaves of soybean Williams 
82. The cDNA obtained was ligated to an adaptor contained in this kit with ligase. This 
operation was made according to the protocol attached. Using the adaptor-hgated cDNA 

1 5 thus prepared, polymerase chain reaction was effected with the primers shown in Ust 9 
below. The nucleotide sequences in both terminal regions of the gene were analyzed 
according to the protocol attached to the Marathon Kit of Clontech. As a result, the 
nucleotide sequence of SEQ ID N0:4 was determined. 

(List 8) 

20 1-F primer 35mer 

CGATTIAAIGTITGGTGGACIACICAITGGGTIGG 
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2-RV primer 45mer 

GGCCTAIAAIGCITCCCAIGTICACCAICCIAAITTITCIATIAT 

5-F primer 41mer 

CGATGGATGGGIAAITTIATICAICCIGAITGGGAIATGTT 

5 6-RV primer 32mer 

GGCCACATITTIACIA(AG) ICCIATIGGIGCIAA 

(List 9) 
SN-1 SOmer 

CACGAACTGGGGCACGAGACACAGATGATG 

10 SC-3RV 30mer 

AAGCAAGTCACGGAGTGTGAATAGTCAGAG 

SC-5 SOmer 

ACACGAGACTGTTTGTTTGAAGACCCCTTG 

SC-6 25mer 

1 5 TGGAATCTCAACAAATATACAGGTG 

SN-3RV 30mer 

GGGTCATGGCCAACGTGGACGTATAAGCAC 

SN-4RV 30mer 

GATGATCACTGGCGCGGTTTTCTCCTCGAG 

20 Example 10 (Acquisition of Raffinose Synthase Gene from Japanese 

Artichoke cDNA) 

In the same manner as described in Example 5, cDNA was obtained from 
leaves of Japanese artichoke {Stachys sieboldii). Using this cDNA as a template and 
primers designed from the amino acid sequence of SEQ ID NO:l, i.e., primers having 

25 nucleotide sequences shown in list 10 below, a DNA fragment was amplified by PGR 
under the conditions described in Example 6. The DNA fragment thus amplified by PGR 
was cloned with a TA cloning kit (Invitrogen), followed by sequence reaction using ABI 
PRISM Dye Terminator Gycle Sequencing Ready Reaction Kit of Perkin-Elmer and 
nucleotide sequence analysis with a 373S DNA sequencer of ABI. As a result, the 

3 0 nucleotide sequence of SEQ ID:6 was determined. 

Based on the nucleotide sequence thus obtained, synthesized DNA primers 
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are prepared, and in the same manner as described in Example 9, the nucleotide sequences 
in both terminal regions of the gene are analyzed with Marathon Kit of Clontech. 



(List 10) 

1- F primer 35mer 

5 CGATT I AAI GT I TGGTGGAC I AC IC AITGGGT IGG 

4-RV primer 37mer 

GGCC AGC lATIACICCITTICCI TTI AAI TGI TTITT 

2- F primer 44mer 

CGAATIATIGAIAAITTIGGITGGTGIACITGGGAIGCITTITA 

10 6-RV primer 32mer 

GGCCACATITTIACIA (AG) ICCIATIGGIGCIAA 

Example 1 1 (Acquisition of Raffinose Synthase Gene from Com cDNA) 
In the same manner as described in Example 5, cDNA was obtained from 
leaves of com (Zea mays L.) Pioneer 3358. Using this cDNA as a template and primers 

1 5 designed from the amino acid sequence of SEQ ID NO:l, i.e., primers having nucleotide 
sequences shown in list 1 1 below, a DNA fragment was amplified by PCR under the 
conditions described in Example 6. The DNA fragment thus amplified by PCR was 
cloned with a TA cloning kit (Invitrogen), followed by sequence reaction using ABI 
PRISM Dye Terminator Cycle Sequencing Ready Reaction Kit of Perkin-Elmer and 

20 nucleotide sequence analysis with a 373S DNA sequencer of ABI. Based on this 
sequence, primers having nucleotide sequences shown in list 12 below were synthesized. 
In the same manner as described in Example 5, mRNA obtained from leaves of com {Zea 
mays L.) Pioneer 3358 was linked to an adaptor contained in the Marathon Kit of 
Clontech with ligase. This operation was made according to the protocol attached. Using 

25 the adaptor-ligated cDNA thus prepared, polymerase chain reaction was effected in the 
same manner as described above with the primers shown in list 12 below. As a result, the 
nucleotide sequence of SEQ ID N0:8 was determined. 

Based on the nucleotide sequence thus obtained, synthesized DNA primers 
are prepared, and in the same manner as described in Example 9, the nucleotide sequence 



in the 5 '-terminal region of the gene is analyzed with Marathon Kit of Clontech. 



(List 11) 
5-F primer 41mer 

CGATGGATGGGIAAITTIATICAICCIGAITGGGAIATGTT 

5 6-RV primer 32mer 

GGCC ACATITTI ACI A ( AG ) ICC lATIGGIGC lAA 



(List 12) 
M- 1 0 primer 25mer 

GACGTCGAGTGGAAGAGCGGCAAGG 

10 M- 11 primer 25mer 

C ACCTAC GAGCTC TTC GTCGTTGC C 



Example 12 (Construction of Expression Vectors in Plant for Chimera Gene, 
35S-Broad Bean Raffmose Synthase Gene) 

The plasmid pBluescriptKS-RS having the broad bean raffmose synthase 
1 5 gene obtained in Example 7 was digested with the restriction endonucleases Bam HI and 
Sac I. Using Ligation Kit (Takara), the DNA fragment thus digested was cloned in the 
binary vector pBI121 (Clontech) previously digested with Bam HI and Sac I. The vector 
thus obtained was designated pBI121-RS. 

For an antisense experiment, plasmid pBI121 (Clontech) previously digested 
20 with Bam HI and Sac I was ligated to linkers shown in list 13 below to give pBI121(-). 
This pBI121(-) was used to prepare pBI121(-)-RS in the same manner as described for 
the preparation of pBI121-RS above. 

A similar vector was prepared with pBI221 . The plasmid pBluescriptKS-RS 
obtained in Example 7 was digested with the restriction endonucleases Bam HI and Sac I. 
25 Using Ligation Kit (Takara), the DNA fragment thus digested was cloned in the vector 
pBI221 (Clontech) previously digested with Bam HI and Sac I. The vector thus obtained 
was designated pB 122 1 -RS . 

For an antisense experiment, plasmid pBI221 (Clontech) previously digested 
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with Bam HI and Sac I was ligated to linkers shown in list 13 below to give pBI221(-). 
This pBI221(-) was used to prepare pBI221(-)-RS in the same manner as described for 
the preparation of pBI221-RS above. 

The construction of these expression vectors is shown in Figures 2 and 3. 

(List 13) 
BamSac-(+) linker 25mer 

GATCGAGCTCGTGTCGGATCCAGCT 

BamSac-(-) linker 17mer 

GGATCCGACACGAGCTC 

Example 13 (Transformation of Mustard with Broad Bean Raffinose 
Synthase Gene) 

The vectors pBI121-RS and pBI121(-)-RS prepared in Example 12 were 
used for the transformation of mustard (Brassica juncia) by the Agrobacterium infection 
method. 

Agrobacterium tumefaciens (strain C58C1, rifampicin resistant) previously 
made into a competent state by calcium chloride treatment was transformed independently 
with two plasraids pBI121-RS and pBI121(-)-RS prepared m Example 12. Selection for 
transformants was carried out on LB medium containing 50 |ig/ml rifampicin and 
25 |ig/ml kanamycin by utilizing the character of kanaraycin resistance conferred by the 
kanamycin resistance gene (neomycin phosphotransferase, NPTII) of the introduced 
plasmids. 

The transformant Agrobacterium obtained {Agrobacterium tumefaciens strain 
C58, rifampicin resistant) was cultivated on LB medium containing 50 |ig/ml rifampicin 
and 25 jig/ml kanamycin at 28 °C for a whole day and night, and the culture was used for 
the transformation of mustard by the method described below. 

The seeds of mustard were aseptically sowed on 1/2 MS medium, 2% 
sucrose, 0.7% agar. After one week, cotyledons and petioles of sprouting plants were cut 
out with a scalpel, and transferred to MS medium, 3% sucrose, 0.7% agar, 4.5 \xM. B A, 
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0.05 jiM 2.4-D, 3.3 |iM AgN03, followed by precultivation for 1 day. The precultivated 
cotyledons and petioles were transferred in a 1000-fold dilution of the Agrobacterium 
culture to cause infection for 5 minutes. The infected cotyledons and petioles were 
transferred again to the same medium as used in the precultivation, and cultivated for 3 to 

5 4 days. The cultivated cotyledons and petioles were transferred to MS medium, 3% 
sucrose, 4.5 \iU BA, 0.05 jiM 2.4-D, 3.3 ^iM AgNOs, 500 mg/1 cefotaxim, and 
sterilized with shaking for 1 day. The sterilized cotyledons and petioles were transferred 
to MS medium, 3% sucrose, 0.7% agar, 4.5 ^iM BA, 0.05 \iM 2.4-D, 3.3 fj.M AgNO^, 
100 mg/1 cefotaxim, 20 mg/1 kanamycin, and cultivated for 3 to 4 weeks. The cotyledons 

10 and petioles were transferred to MS medium, 3% sucrose, 0.7% agar, 4.5 \lM BA, 
0.05 |iM 2.4-D, 100 mg/1 cefotaxim, 20 mg/1 kanamycin, and cultivated. The ciiltivation 
on this medium was continued with subculturing at intervals of 3 to 4 weeks. When the 
regeneration of shoots began to occur, these shoots were subcultured on MS medium, 3% 
sucrose, 0.7% agar, 20 mg/1 kanamycin, and cultivated for 3 to 4 weeks. The rooting 

1 5 plants were transferred to vermiculite : peat moss =1:1, and conditioned at 21° to 22°C 
in a cycle of day/night = 12 hours : 12 hours. With the progress of plant body growth, 
the plants were suitably grown with cultivation soil. From leaves of the regenerated 
plants, genomic DNA was extracted according to the method described above, and the 
gene insertion into the plant genome was confirmed by PGR using the primers shown in 

20 list 14 below. 

(List 14) 
35S 30mer 

TTCCAGTATGGACGATTCAAGGCTTGCTTC 

NOS 25mer 

25 ATGTATAATTGCGGGACTCTAATCA 

RS-F 30mer 

AAGAGTGTATCTGAATTTTCACGCGCGGTG 

RS-RV 33mer 

ACCTTCCCATACACCTTTTGGATGAACCTTCAA 
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Example 14 (Transformation of Soybean Somatic Embryo with Broad Bean 
Raffmose Synthase Gene) 

Cultured cells of soybean "Fayette" somatic embryos (400 to 500 mg FW) 
were arranged in one layer within a circle having a diameter of 20 mm on the central part 
5 of a 6 cm agar plate. Two plasmids pBI221-RS and pBI221(-)-RS having chimera genes 
prepared from the broad bean raffmose synthase gene and 35S promoter in Example 12 
were introduced into the soybean somatic embryos according to the disclosure of the 
Japanese Patent Application No. 3-291501/1991. That is, these plasmids were mixed 
with the P-glucuronidase (GUS)/hygromycin-resistant gene (HPT) coexpression vector 
10 pSUM-GH:NotI for selection described in Soshiki Baiyo, 20, 323-327 (1994). These 
mixed plasmids were used for the gene introduction into the soybean somatic embryos 
with a particle gun (800 mg/coating gold particles 200 |ig/shot; projectile stopper-sample 
distance, 100 mm). After the introduction, gyratory cultures were grown in the MS 
modified growth liquid medium (Sigma) containing 25 to 50 [ig/ml hygromycin under 
1 5 illumination at 25 °C for 16 hours, and transformed somatic embryos were selected. 

For the hygromycin-resistant soybean somatic embryos having yellowish 
green color and growth ability, which were selected after about 3 months, polymerization 
chain reaction is effected with primers shown in list 14 above to determine whether the 
broad bean raffinose synthase gene region is amplified or not. This confirms that the 
2 0 broad bean raffmose synthase gene is inserted into the soybean genome. 

Furthermore, the somatic embryos obtained are used for the regeneration of 
plants to give transformant soybean with the broad bean raffinose synthase gene. 
Medium Composition 

LB and MS media used in the above Examples have the following respective 
25 compositions. 

(LB medium) 
Bacto-tryptone 10 g 

Bacto-yeast extract 5 g 
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NaCl 



10 g 



/lliterH2O(pH7.0) 



(MS medium) 



KNO3 
NH4NO3 
NH4CI 
KH2PO4 
MgS04 • 7H2O 
CaCl2 • 2H2O 
MnS04 • 4H2O 
ZnS04 • 7H2O 
CUSO4 • 5H2O 
KI 

C0CI2 • 6H2O 
H3BO3 

NaMo04 • 2H2O 
FeS04 • 7H2O 
Na2EDTA 
Nicotinic acid 
Thiamine HCl 
Pyridoxine HCl 
Inositol 
Glycine 



2022 mg/1 
1650 mg/1 
2140 mg/1 
170 mg/1 
370 mg/1 
440 mg/1 
22.3 mg/1 
8.6 mg/1 
0.025 mg/1 
0.83 mg/1 
0.025 mg/1 
6.2 mg/1 
0.25 mg/1 
27.8 mg/1 
37.3 mg/1 
0.5 mg/1 

1 mg/1 
0.5 mg/1 

100 mg/1 

2 mg/1 



Brief Description of the Sequences 

1. SEQIDNO:l: 

The sequence of SEQ ID N0:1 shows an amino acid sequence of a raffinose 
synthase protein encoded in the raffinose synthase gene obtained from broad bean. 

2. SEQIDNO:2: 

The sequence of SEQ ID NO:2 shows a cDNA nucleotide sequence of the 
raffinose synthase gene obtained from broad bean. 

3. SEQIDNO:3: 

The sequence of SEQ ID NO:3 shows an amino acid sequence of a raffinose 
synthase protein encoded in the raffinose synthase gene obtained from soybean. 
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4. SEQIDNO:4: 

The sequence of SEQ ED N0:4 shows a cDNA nucleotide sequence of the 
raffinose synthase gene obtained from soybean. 

5. SEQIDNO:5: 

5 The sequence of SEQ ID N0:5 shows an amino acid sequence of a raffinose 

synthase protein encoded in the raffinose synthase gene obtained from Japanese artichoke. 

6. SEQIDN0:6: 

The sequence of SEQ ID N0:6 shows a cDNA nucleotide sequence of the 
raffinose synthase gene obtained from Japanese artichoke. 
10 7. SEQIDNO:7: 

The sequence of SEQ ID NO:7 shows an amino acid sequence of a raffinose 
synthase protein encoded in the raffinose synthase gene obtained from com. 

8. SEQIDN0:8: 

The sequence of SEQ ID NO: 8 shows a cDNA nucleotide sequence of the 
1 5 raffinose synthase gene obtained from com. 

9. Listl: 

The nucleotide sequences shown in list 1 are of the typical primers used in the 
amplification of a cDNA fragment of a raffinose synthase gene. All of these sequences 
are based on the nucleotide sequence in the non-coding region of the gene. Primer 1 is a 

20 sense primer corresponding to the 5 '-terminus of a cDNA fragment of the broad bean- 
derived raffinose synthase gene. Primers 2 and 3 are antisense primers corresponding to 
the 3 '-terminus of the cDNA fragment of the broad bean-derived raffinose synthase gene. 
Primer 4 is a sense primer corresponding to the 5 '-terminus of a cDNA fragment of the 
soybean-derived raffinose synthase gene. Primers 5 and 6 are antisense primers 

25 corresponding to the 3'-terrainus of the cDNA fragment of the soybean-derived raffinose 
synthase gene. Depending upon the purpose, recognition sequences for suitable 
restriction endonucleases can be added to the 5 '-termini of these nucleotide sequences in 
an appropriate manner. 



10. List 2: 

The nucleotide sequences shown in list 2 are of the typical primers used in the 
amplification of an open reading frame coding for the anndno acid sequence of a raffinose 
synthase protein in the cDNA sequence of a raffinose synthase gene. Primers 1 and 2 are 
sense primers corresponding to the N-terminus of the broad bean-derived raffinose 
synthase protein. Primers 3 and 4 are antisense primers corresponding to the C-terminus 
of the broad bean-derived raffinose synthase protein. Primers 5 and 6 are sense primers 
corresponding to the N-terminus of the soybean-derived raffinose synthase protein. 
Primers 7 and 8 are antisense primers corresponding to the C-terminus of the soybean- 
derived raffinose synthase protein. Depending upon the purpose, recognition sequences 
for suitable restriction endonucleases can be added to the 5 '-termini of these sequences in 
an appropriate manner. 

11. Lists: 

The amino acid sequences shown in list 3 are partial amino acid sequences of 
a raffinose synthase protein. 

#1 is equivalent to the partial amino acid sequence extending from amino acid 
1 10 to amino acid 129 in the amino acid sequence of SEQ ID NO: 1. 

#2 is equivalent to the partial amino acid sequence extending from amino acid 
234 to amino acid 247 in the amino acid sequence of SEQ ED NO:l. 

#3 is equivalent to the partial amino acid sequence extending from amino acid 
265 to amino acid 279 in the amino acid sequence of SEQ ID NO:l. 

#4 is equivalent to the partial amino acid sequence extending from amino acid 
296 to amino acid 312 in the amino acid sequence of SEQ ID NO:l. 

#5 is equivalent to the partial amino acid sequence extending from amino acid 
346 to amino acid 361 in the amino acid sequence of SEQ ID N0:1. 

#6 is equivalent to the partial amino acid sequence extending from amino acid 
383 to amino acid 402 in the amino acid sequence of SEQ ID NO:l. 

#7 is equivalent to the partial amino acid sequence extending from amino acid 
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41 1 to amino acid 433 in the amino acid sequence of SEQ ID NO:l. 

#8 is equivalent to the partial amino acid sequence extending from amino acid 
440 to amino acid 453 in the amino acid sequence of SEQ ID N0:1. 

#9 is equivalent to the partial amino acid sequence extending from amino acid 
5 457 to amino acid 468 in the amino acid sequence of SEQ ID NO:l. 

#10 is equivalent to the partial amino acid sequence extending from amino 
acid 471 to amino acid 516 in the amino acid sequence of SEQ ID NO:l. 

#1 1 is equivalent to the partial amino acid sequence extending from amino 
acid 517 to amino acid 559 in the amino acid sequence of SEQ ID N0:1. 
10 #12 is equivalent to the partial amino acid sequence extending from amino 

acid 574 to amino acid 582 in the amino acid sequence of SEQ ID NO:l . 

#13 is equivalent to the partial amino acid sequence extending from amino 
acid 586 to amino acid 609 in the amino acid sequence of SEQ ID N0:1. 

#14 is equivalent to the partial amino acid sequence extending from amino 
15 acid 6 1 5 to amino acid 627 in the amino acid sequence of SEQ ID NO : 1 . 

#15 is equivalent to the partial amino acid sequence extending from amino 
acid 716 to amino acid 724 in the amino acid sequence of SEQ ID NO: 1. 

12. List 4: 

The nucleotide sequences shown in list 4 are of the typical primers synthe- 
20 sized on some of the amino acid sequences shown in list 3. The symbol "F' as used after 
the primer number means that the primer referred to by this symbol has a sense sequence. 
The symbol "RV" as used after the primer number means that the primer referred to by 
this symbol has an antisense sequence. Primer 1 corresponds to the partial amino acid 
sequence extending from amino acid 1 19 to amino acid 129 in the amino acid sequence of 
25 SEQ ID NO:l. Primer 2 corresponds to the partial amino acid sequence extending from 
amino acid 234 to amino acid 247 in the amino acid sequence of SEQ ID NO:l. Primer 3 
corresponds to the partial amino acid sequence extending from amino acid 265 to amino 
acid 279 in the amino acid sequence of SEQ ID NO:l. Primer 4 corresponds to the partial 
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amino acid sequence extending from amino acid 458 to amino acid 468 in the amino acid 
sequence of SEQ ID NO:l. Primer 5 corresponds to the partial amino acid sequence 
extending from amino acid 522 to amino acid 534 in the amino acid sequence of SEQ ID 
NO: 1 . Primer 6 corresponds to the partial amino acid sequence extending from amino 
acid 7 1 6 to amino acid 724 in the amino acid sequence of SEQ ID NO: 1 . 

13. List 5: 

The nucleotide sequences shown in list 5 are of the typical primers synthe- 
sized on the partial amino acid sequences of the purified broad bean raffinose synthase 
protein. The bases shown in parentheses mean that a mixture of those bases was used in 
the synthesis. The symbol "RV" as used after the primer number means that the primer 
referred to by this symbol has an antisense sequence. 

14. List 6: 

The nucleotide sequences shown in list 6 are of the typical primers used in the 
analysis of both terminal regions of a cDNA nucleotide sequence of the broad bean 
raffinose synthase gene by the RACE method. The symbol "RV" as used after the primer 
number means that the primer referred to by this symbol has an antisense sequence. 

15. List 7: 

The nucleotide sequences shown in list 7 are of the typical primers used in the 
cloning of the broad bean raffinose synthase gene. RS-N corresponds to the N-terminus 
of the open reading frame and contains two recognition sites for the restriction 
endonucleases Bam HI and Nco I on the 5 '-terminal side. RS-C is an antisense primer 
corresponding to the C-terminus of the open reading frame and contains a recognition site 
for the restriction endonuclease Xba I on the 5'-terminal side. 

16. Lists: 

The nucleotide sequences shown in list 8 are of the typical primers used in the 
cloning of a soybean raffinose synthase gene fragment. The base represented by the 
symbol "I" was inosine used in the synthesis. The symbol "RV" as used after the primer 
number means that the primer referred to by this symbol has an antisense sequence. 



The nucleotide sequences shown in list 9 are of the typical primers used in the 
analysis of the cDNA nucleotide sequence of a soybean raffinose synthase gene fragment. 
The symbol "RV" as used after the primer number means that the primer referred to by 
5 this symbol has an antisense sequence. 

The analysis of nucleotide sequences was carried out by polymerase chain 
reaction using SN-1 and SC-3RV. SC-5 and SC-6 were used in the analysis of a 
nucleotide sequence in the 3'-terminal region, and SN-3RV and SN-4RV were used in the 
analysis of a nucleotide sequence in the 5' -terminal region. 
10 18. List 10: 

The nucleotide sequences shown in list 10 are of the typical primers used in 
the analysis of the cDNA nucleotide sequence of a Japanese artichoke raffmose synthase 
gene fragment. The base represented by the symbol 'T' was inosine used in the synthe- 
sis. The symbol "RV" as used after the primer number means that the primer referred to 
15 by this symbol has an antisense sequence. 

19. List 11: 

The nucleotide sequences shown in list 11 are of the typical primers used in 
the analysis of the cDNA nucleotide sequence of a com raffinose synthase gene fragment. 
The base represented by the symbol "I" was inosine used in the synthesis. The symbol 
20 "RV" as used after tfie primer number means that the primer referred to by this symbol has 
an antisense sequence. 

20. List 12: 

The nucleotide sequences shown in list 12 are of the typical primers used in 
the analysis of the cDNA nucleotide sequence of a com raffinose synthase gene fragment. 
25 M-10 and M-11 were used in the analysis of a nucleotide sequence in the 3 '-terminal 
region. 

21. List 13: 

The nucleotide sequences shown in list 13 are of the typical adopters used in 
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the construction of vectors for antisense experiments. These synthetic DNA fragments 
takes a double-stranded form when mixed together because they are complementary 
strands. This double-stranded DNA fragment has cohesive ends of cleavage sites for the 
restriction endonucleases Bam HI and Sac I on both termini, and contains the restriction 
5 sites for the restriction endonucleases Bam HI and Sac I in the double-stranded region. 
22. List 14: 

The nucleotide sequences shown in list 14 are of the typical primers used in 
the PCR experiment to confirm the gene introduction into the genome of a recombinant 
plant. 35S is a primer toward the downstream region at the 35S promoter site, and NOS 
10 is a primer toward the upstream region at the NOS terminator site. RS-F is a sense primer 
of the broad bean raffinose synthase gene, and RS-RV is an antisense primer of the broad 
bean raffinose synthase gene. 
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SEQUENCE LISTING 



GENERAL INFORMATION: 

(i) APPLICANT: Eijiro, Watanabe 
Ken j i , Oeda 

(ii) TITLE OF INVENTION: RAFFINOSE SYNTHASE GENES AND THEIR USE 
(iii) NUMBER OF SEQUENCES: 8 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Birch, Stewart, Kolasch & Birch, LLP 

(B) STREET: 8110 Gatehouse Road, Suite 500 East 

(C) CITY: Falls Church 

(D) STATE: Virginia 

(E) COUNTRY: USA 

(F) ZIP: 22042 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: C12N 9/00, C12N 15/52 

(vii) PRIOR APPLICATION DATE: 

(A) APPLICATION NUMBER: JP-33 8 673 /199 6 

(B) FILING DATE: 18-12-1996 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NTiME: 

(B) REGISTRATION NUMBER: 

(C) REFERENCE /DOCKET NUMBER: 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (703)205-8000 
{B> TELEFAX: (703)205-8050 



) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: broad bean (Vicia faba) 

(B) STRAIN: Nintoku Is sun 
(F) TISSUE TYPE: seeds 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



Met 


Ala 


Pro 


Pro 


Ser 


He 


Thr 


Lys 


Thr 


Ala 


Thr 


Leu 


Gin 


Asp 


Val 


He 


1 








5 










10 










15 




Ser 


Thr 


He 


Asp 


He 


Gly Asn 


Gly 


Asn 


Ser 


Pro 


Leu 


Phe 


Ser 


He 


Thr 








20 










25 










30 






Leu 


Asp 


Gin 


Ser 


Arg 


Asp 


Phe 


Leu 


Ala 


Asn Gly 


His 


Pro 


Phe 


Leu 


Thr 






35 










40 










45 








Gin Val 


Pro 


Pro 


Asn 


He 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Ala 


Ser 


Ser 




50 










55 










60 










Phe 


Leu 


Asn 


Leu 


Lys 


Ser 


Asn 


Lys 


Asp 


Thr 


He 


Pro 


Asn 


Asn 


Asn 


Asn 


65 










70 










75 










80 


Thr 


Met 


Leu 


Leu 


Gin 
85 


Gin 


Gly 


Cys 


Phe 


Val 
90 


Gly 


Phe 


Asn 


Ser 


Thr 
95 


Glu 


Pro 


Lys 


Ser 


His 


His 


Val 


Val 


Pro 


Leu 


Gly Lys 


Leu 


Lys 


Gly 


He 


Lys 








100 










105 










110 






Phe 


Met 


Ser 
115 


He 


Phe 


Arg 


Phe 


Lys 
120 


Val 


Trp 


Trp 


Thr 


Thr 
125 


His 


Trp 


Val 


Gly 


Thr 


Asn 


Gly Gin 


Glu 


Leu 


Gin 


His 


Glu 


Thr 


Gin 


Met 


Leu 


He 


Leu 




130 










135 










140 










Asp 


Lys 


Asn 


Asp 


Ser 


Leu 


Gly 


Arg 


Pro 


Tyr 


Val 


Leu 


Leu 


Leu 


Pro 


He 


145 










150 










155 










160 


Leu 


Glu 


Asn 


Thr 


Phe 
165 




Thr 


Ser 


Leu 


Gin 
170 


Pro 


Gly 


Leu 


Asn 


Asp 
175 


His 


lie Gly Met 


Ser 


Val 


Glu 


Ser 


Gly 


Ser 


Thr 


His 


Val 


Thr 


Gly 


Ser 


Ser 








180 










185 










190 






Phe 


Lys 


Ala 
195 


Cys 


Leu 


Tyr 


He 


His 
200 


Leu 


Ser 


Asn 


Asp 


Pro 
205 


Tyr 


Ser 


He 


Leu 


Lys 
210 


Glu 


Ala 


Val 


Lys 


Val 
215 


He 


Gin 


Thr 


Gin 


Leu 
220 


Gly 


Thr 


Phe 


Lys 


Thr 


Leu 


Glu 


Glu 


Lys 


Thr 


Ala 


Pro 


Ser 


He 


He 


Asp 


Lys 


Phe 


Gly 


Trp 


225 










230 










235 










240 


Cys 


Thr 


Trp 


Asp 


Ala 
245 


Phe 


Tyr 


Leu 


Lys 


Val 
250 


His 


Pro 


Lys 


Gly 


Val 
255 


Trp 


Glu 


Gly Val 


Lys 


Ser 


Leu 


Thr 


Asp 


Gly 


Gly Cys 


Pro 


Pro 


Gly 


Phe 


Val 








250 










265 










270 






He 


He 


Asp 
275 


Asp 


Gly 


Trp 


Gin 


Ser 
280 


He 


Cys 


His 


Asp 


Asp 
285 


Asp 


Asp 


Glu 


Asp 


Asp 


Ser 


Gly Met 


Asn 


Arg 


Thr 


Ser 


Ala 


Gly 


Glu 


Gin 


Met 


Pro 


Cys 




290 










295 










300 










Arg 


Leu 


Val 


Lys 


Tyr 


Glu 


Glu 


Asn 


Ser 


Lys 


Phe 


Arg 


Glu 


Tyr 


Glu 


Asn 


305 










310 










315 












Pro 


Glu 


Asn 


Gly Gly Lys 


Lys 


Gly 


Leu 


Gly Gly 


Phe 


Val 


Arg 


Asp 


Leu 


320 








325 










330 










335 




Lys 


Glu 


Glu 


Phe 
340 


Gly 


Ser 


Val 


Glu 


Ser 
345 


Val 


Tyr 


Val 


Trp 


His 
350 


Ala 


Leu 


Cys 


Gly 


Tyr 


Trp 


Gly Gly Val 


Arg 


Pro 


Gly Val 


His 


Gly 


Met 


Pro 


Lys 






355 










360 










365 








Ala 


Arg 


Val 


Val 


Val 


Pro 


Lys 


Val 


Ser 


Gin Gly 


Leu 


Lys 


Met 


Thr 


Met 



370 375 380 
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Glu Asp Leu Ala Val Asp Lys He Val Glu Asn Gly Val Gly Leu Val 
385 390 395 400 

Pro Pro Asp Phe Ala His Glu Met Phe Asp Gly Leu His Ser His Leu 

405 410 415 

Glu Ser Ala Gly He Asp Gly Val Lys Val Asp Val He His Leu Leu 

420 425 430 

Glu Leu Leu Ser Glu Glu Tyr Gly Gly Arg Val Glu Leu Ala Arg Ala 

435 440 445 

Tyr Tyr Lys Ala Leu Thr Ser Ser Val Lys Lys His Phe Lys Gly Asn 

450 455 460 

Gly Val He Ala Ser Met Glu His Cys Asn Asp Phe Phe Leu Leu Gly 
465 470 475 480 

Thr Glu Ala He Ser Leu Gly Arg Val Gly Asp Asp Phe Trp Cys Ser 

485 490 495 

Asp Pro Ser Gly Asp Pro Asn Gly Thr Tyr Trp Leu Gin Gly Cys His 

500 505 510 

Met Val His Cys Ala Tyr Asn Ser Leu Trp Met Gly Asn Phe He Gin 

515 520 525 

Pro Asp Trp Asp Met Phe Gin Ser Thr His Pro Cys Ala Glu Phe His 

530 535 540 

Ala Ala Ser Arg Ala He Ser Gly Gly Pro He Tyr Val Ser Asp Cys 
545 550 555 560 

Val Gly Asn His Asn Phe Lys Leu Leu Lys Ser Leu Val Leu Pro Asp 

565 570 575 

Gly Ser He Leu Arg Cys Gin His Tyr Ala Leu Pro Thr Arg Asp Cys 

580 585 590 

Leu Phe Glu Asp Pro Leu His Asn Gly Lys Thr Met Leu Lys He Trp 

595 600 605 

Asn Leu Asn Lys Tyr Thr Gly Val Leu Gly Leu Phe Asn Cys Gin Gly 

610 615 620 

Gly Gly Trp Cys Pro Glu Ala Arg Arg Asn Lys Ser Val Ser Glu Phe 
625 630 635 640 

Ser Arg Ala Val Thr Cys Tyr Ala Ser Pro Glu Asp He Glu Trp Cys 

645 650 655 

Asn Gly Lys Thr Pro Met Ser Thr Lys Gly Val Asp Phe Phe Ala Val 

660 665 670 

Tyr Phe Phe Lys Glu Lys Lys Leu Arg Leu Met Lys Cys Ser Asp Arg 

675 680 685 

Leu Lys Val Ser Leu Glu Pro Phe Ser Phe Glu Leu Met Thr Val Ser 

690 695 700 

Pro Val Lys Val Phe Ser Lys Arg Phe He Gin Phe Ala Pro He Gly 
705 710 715 720 

Leu Val Asn Met Leu Asn Ser Gly Gly Ala He Gin Ser Leu Glu Phe 

725 730 735 

Asp Asp Asn Ala Ser Leu Val Lys He Gly Val Arg Gly Cys Gly Glu 

740 745 750 

Met Ser Val Phe Ala Ser Glu Lys Pro Val Cys Cys Lys He Asp Gly 

755 760 765 

Val Lys Val Lys Phe Leu Tyr Glu Asp Lys Met Ala Arg Val Gin He 

770 775 780 

Leu Trp Pro Ser Ser Ser Thr Leu Ser Leu Val Gin Phe Leu Phe Stop 
785 790 795 800 



(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2745 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA to mENA 

(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 101 to 2500 

(C) IDENTIFICATION METHOD: by experiment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

AATTTTCAAG CATAGCCAAG TTAACCACCT TAGAAACATT CCTACAAGCT ACTTATCCCT 60 
GTCAATAAGC TACTAAGCTA CCAGAGTCTC ATCAATCACC ATG GCA CCA CCA AGC 115 

Met Ala Pro Pro Ser 
5 

ATA ACC AAA ACT GCA ACC CTC CAA GAC GTA ATA AGC ACC ATC GAT ATT 163 
lie Thr Lys Thr Ala Thr Leu Gin Asp Val lie Ser Thr lie Asp lie 

10 15 20 

GGT AAT GGT AAC TCA CCC TTA TTC TCC ATA ACC TTA GAC CAA TCA CGT 211 
Gly Asn Gly Asn Ser Pro Leu Phe Ser lie Thr Leu Asp Gin Ser Arg 

25 30 35 

GAC TTC CTT GCA AAT GGC CAC CCT TTC CTC ACC CAA GTC CCA CCT AAC 259 
Asp Phe Leu Ala Asn Gly His Pro Phe Leu Thr Gin Val Pro Pro Asn 

40 45 50 

ATA ACA ACA AC A ACA ACA ACC ACT GCT TCC TCT TTT CTC AAT CTC AAA 307 
lie Thr Thr Thr Thr Thr Thr Thr Ala Ser Ser Phe Leu Asn Leu Lys 

55 60 65 

TCC AAC AAA GAT ACC ATT CCC AAC AAC AAC AAC ACC ATG TTG TTG CAA 355 
Ser Asn Lys Asp Thr lie Pro Asn Asn Asn Asn Thr Met Leu Leu Gin 
70 75 80 85 

CAA GGT TGT TTC GTT GGT TTC AAC TCC ACC GAA CCC AAA AGC CAC CAC 403 
Gin Gly Cys Phe Val Gly Phe Asn Ser Thr Glu Pro Lys Ser His His 

90 95 100 

GTA GTT CCA CTC GGC AAA CTA AAA GGA ATC AAA TTC ATG AGC ATA TTC 451 
Val Val Pro Leu Gly Lys Leu Lys Gly lie Lys Phe Met Ser lie Phe 

105 110 115 

CGG TTC AAA GTT TGG TGG ACA ACT CAC TGG GTC GGA ACA AAT GGA CAG 49 9 

Arg Phe Lys Val Trp Trp Thr Thr His Trp Val Gly Thr Asn Gly Gin 

120 125 130 

GAA CTA CAA CAC GAA ACA CAA ATG TTA ATC CTG GAC AAA AAC GAC TCC 547 
Glu Leu Gin His Glu Thr Gin Met Leu lie Leu Asp Lys Asn Asp Ser 

135 140 145 

CTC GGA CGA CCC TAT GTC TTA CTC CTC CCA ATC CTA GAA AAC ACC TTC 595 
Leu Gly Arg Pro Tyr Val Leu Leu Leu Pro lie Leu Glu Asn Thr Phe 
150 155 160 165 

CGA ACC TCA CTC CAA CCC GGT CTC AAC GAT CAC ATA GGC ATG TCC GTC 643 
Arg Thr Ser Leu Gin Pro Gly Leu Asn Asp His lie Gly Met Ser Val 
170 175 180 
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GAA AGC GGT TCA ACA CAT GTC ACC GGG TCA AGC TTC AAA GCA TGT CTT 
Glu Ser Gly Ser Thr His Val Thr Gly Ser Ser Phe Lys Ala Cys Leu 

185 190 195 

TAG ATC CAT CTC ACT AAC GAC CCA TAC AGT ATA CTA AAA GAA GCA GTT 
Tyr lie His Leu Ser Asn Asp Pro Tyr Ser He Leu Lys Glu Ala Val 

200 205 210 

AAA GTA ATC CAA ACT CAG TTA GGA ACA TTC AAG ACT CTT GAA GAA AAA 
Lys Val He Gin Thr Gin Leu Gly Thr Phe Lys Thr Leu Glu Glu Lys 

215 220 225 

ACA GCA CCT AGT ATT ATA GAC AAA TTC GGT TGG TGC ACG TGG GAT GCT 
Thr Ala Pro Ser He He Asp Lys Phe Gly Trp Cys Thr Trp Asp Ala 
230 235 240 245 

TTT TAC TTG AAG GTT CAT CCA AAA GGT GTA TGG GAA GGT GTA AAG TCT 
Phe Tyr Leu Lys Val His Pro Lys Gly Val Trp Glu Gly Val Lys Ser 

250 255 260 

CTC ACA GAT GGT GGT TGT CCT CCC GGT TTC GTC ATA ATC GAC GAC GGT 
Leu Thr Asp Gly Gly Cys Pro Pro Gly Phe Val He He Asp Asp Gly 

265 270 275 

TGG CAA TCC ATT TGT CAT GAC GAT GAC GAT GAA GAT GAT TCA GGA ATG 
Trp Gin Ser He Cys His Asp Asp Asp Asp Glu Asp Asp Ser Gly Met 

280 285 290 

AAC CGA ACC TCA GCC GGG GAA CAA ATG CCA TGC AGA CTT GTA AAA TAC 
Asn Arg Thr Ser Ala Gly Glu Gin Met Pro Cys Arg Leu Val Lys Tyr 

295 300 305 

GAA GAG AAT TCT AAG TTT AGA GAA TAT GAG AAT CCT GAA AAT GGA GGG 
Glu Glu Asn Ser Lys Phe Arg Glu Tyr Glu Asn Pro Glu Asn Gly Gly 
310 315 320 325 

AAG AAA GGT TTG GGT GGT TTT GTG AGG GAT TTG AAG GAA GAG TTT GGG 
Lys Lys Gly Leu Gly Gly Phe Val Arg Asp Leu Lys Glu Glu Phe Gly 

330 335 340 

AGT GTG GAG AGT GTT TAT GTT TGG CAT GCG CTT TGT GGG TAT TGG GGC 
Ser Val Glu Ser Val Tyr Val Trp His Ala Leu Cys Gly Tyr Trp Gly 

345 350 355 

GGG GTT AGG CCT GGA GTG CAT GGG ATG CCG AAA GCT AGG GTT GTT GTT 
Gly Val Arg Pro Gly Val His Gly Met Pro Lys Ala Arg Val Val Val 

360 365 370 

CCG AAG GTG TCT CAG GGG TTG AAG ATG ACG ATG GAG GAT TTG GCG GTG 
Pro Lys Val Ser Gin Gly Leu Lys Met Thr Met Glu Asp Leu Ala Val 

375 380 385 

GAT AAG ATT GTT GAG AAC GGT GTG GGG CTA GTG CCG CCA GAT TTT GCA 
Asp Lys He Val Glu Asn Gly Val Gly Leu Val Pro Pro Asp Phe Ala 
390 395 400 405 

CAT GAG ATG TTT GAT GGG CTT CAC TCT CAT TTG GAG TCG GCG GGA ATT 
His Glu Met Phe Asp Gly Leu His Ser His Leu Glu Ser Ala Gly He 

410 415 420 

GAC GGT GTT AAA GTT GAT GTT ATC CAT CTG CTT GAG TTA CTA TCA GAG 
Asp Gly Val Lys Val Asp Val He His Leu Leu Glu Leu Leu Ser Glu 

425 430 435 

GAA TAT GGT GGA CGA GTT GAG CTA GCA AGA GCT TAT TAC AAA GCA CTA 
Glu Tyr Gly Gly Arg Val Glu Leu Ala Arg Ala Tyr Tyr Lys Ala Leu 

440 445 450 

ACC TCA TCA GTG AAG AAA CAT TTC AAA GGC AAT GGT GTA ATT GCT AGC 
Thr Ser Ser Val Lys Lys His Phe Lys Gly Asn Gly Val He Ala Ser 
455 460 465 
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ATG GAG CAT TGC AAC GAC TTC TTT CTC CTC GGC ACC GAA GCC ATA TCC 
Met Glu His Cys Asn Asp Phe Phe Leu Leu Gly Thr Glu Ala lie Ser 
470 475 480 485 

CTC GGC CGC GTC GGA GAT GAT TTT TGG TGC TCT GAT CCA TCT GGT GAT 
Leu Gly Arg Val Gly Asp Asp Phe Trp Cys Ser Asp Pro Ser Gly Asp 

490 495 500 

CCA AAT GGT ACA TAT TGG CTC CAA GGT TGT CAC ATG GTA CAT TGT GCC 
Pro Asn Gly Thr Tyr Trp Leu Gin Gly Cys His Met Val His Cys Ala 

505 510 515 

TAC AAC AGT TTA TGG ATG GGA AAT TTC ATT CAG CCA GAT TGG GAC ATG 
Tyr Asn Ser Leu Trp Met Gly Asn Phe lie Gin Pro Asp Trp Asp Met 

520 525 530 

TTT CAG TCC ACT CAT CCT TGT GCT GAA TTT CAT GCC GCC TCA CGA GCC 
Phe Gin Ser Thr His Pro Cys Ala Glu Phe His Ala Ala Ser Arg Ala 

535 540 545 

ATA TCC GGC GGA CCA ATT TAT GTT AGT GAT TGT GTT GGT AAT CAC AAT 
lie Ser Gly Gly Pro lie Tyr Val Ser Asp Cys Val Gly Asn His Asn 
550 555 560 565 

TTC AAG TTG CTC AAA TCT CTT GTT TTG CCC GAT GGT TCT ATC TTG CGT 
Phe Lys Leu Leu Lys Ser Leu Val Leu Pro Asp Gly Ser lie Leu Arg 

570 575 580 

TGT CAA CAT TAC GCA CTC CCT ACA AGA GAT TGC TTG TTT GAA GAC CCT 
Cys Gin His Tyr Ala Leu Pro Thr Arg Asp Cys Leu Phe Glu Asp Pro 

585 590 595 

TTG CAT AAT GGC AAA ACA ATG CTG AAA ATT TGG AAT CTC AAC AAA TAT 
Leu His Asn Gly Lys Thr Met Leu Lys lie Trp Asn Leu Asn Lys Tyr 

600 605 610 

ACA GGT GTT TTG GGT CTT TTC AAC TGC CAA GGT GGT GGG TGG TGT CCT 
Thr Gly Val Leu Gly Leu Phe Asn Cys Gin Gly Gly Gly Trp Cys Pro 

615 620 625 

GAG GCA CGG CGA AAC AAG AGT GTA TCT GAA TTT TCA CGC GCG GTG ACA 
Glu Ala Arg Arg Asn Lys Ser Val Ser Glu Phe Ser Arg Ala Val Thr 
630 635 640 645 

TGT TAT GCA AGT CCC GAA GAC ATT GAA TGG TGC AAT GGG AAA ACT CCA 
Cys Tyr Ala Ser Pro Glu Asp He Glu Trp Cys Asn Gly Lys Thr Pro 

650 655 660 

ATG AGC ACC AAA GGT GTG GAT TTT TTT GCT GTG TAT TTT TTC AAG GAG 

Met Ser Thr Lys Gly Val Asp Phe Phe Ala Val Tyr Phe Phe Lys Glu 

665 670 675 

AAG AAA TTG AGG CTC ATG AAG TGT TCT GAT AGA TTG AAA GTT TCG CTT 
Lys Lys Leu Arg Leu Met Lys Cys Ser Asp Arg Leu Lys Val Ser Leu 

680 685 690 

GAG CCA TTT AGT TTT GAG CTA ATG ACA GTG TCT CCA GTG AAA GTG TTT 
Glu Pro Phe Ser Phe Glu Leu Met Thr Val Ser Pro Val Lys Val Phe 

695 700 705 

TCG AAA AGG TTT ATA CAG TTT GCA CCG ATT GGG TTA GTG AAC ATG CTG 
Ser Lys Arg Phe He Gin Phe Ala Pro He Gly Leu Val Asn Met Leu 
710 715 720 725 

AAC TCT GGT GGT GCG ATT CAG TCT CTG GAG TTT GAT GAT AAT GCA AGT 
Asn Ser Gly Gly Ala He Gin Ser Leu Glu Phe Asp Asp Asn Ala Ser 

730 735 740 

TTG GTC AAG ATT GGG GTG AGA GGT TGC GGG GAG ATG AGC GTG TTT GCG 
Leu Val Lys He Gly Val Arg Gly Cys Gly Glu Met Ser Val Phe Ala 
745 750 755 
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TCT GAG AAA CCG GTT TGC TGC AAA ATT GAT GGG GTT AAG GTG AAA TTT 2419 
Ser Glu Lys Pro Val Cys Cys Lys lie Asp Gly Val Lys Val Lys Phe 

760 765 770 

CTT TAT GAG GAG AAA ATG GCA AGA GTT CAA ATT CTG TGG CCT AGT TCT 2467 
Leu Tyr Glu Asp Lys Met Ala Arg Val Gin lie Leu Trp Pro Ser Ser 

775 780 785 

TCA ACA TTG TCT TTG GTC CAG TTT TTA TTT TGA TCCCTAGGAA TCCTATGCAC 2 52 0 
Ser Thr Leu Ser Leu Val Gin Phe Leu Phe Stop 
790 795 800 

GTGTCTCTGT TTACAAGTAC TTTATATAAG TATAATATGT ATCTATTTCC ATTTTTAACT 2580 
GTCTTTATGC AATTAGGTGG TCAATTAGTT ATTTGTTTGT GAAGTAACTA ACTTGCTTGT 2 640 
GTTGTAAGCT TATAATATAT GGTCAAGTTC CTCACTTGTA TATACCTGTT GTATGTATAA 2700 
ATTTTACTAT ATATGACTAA CATCATTATC TTGTGAGCAA AAAAAA 2746 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 781 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: soybean (Glycine max) 

(B) STRAIN: Williams 82 

(F) TISSUE TYPE: seeds and leaves 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Met 


Ala 


Pro 


Ser 


He 
5 


Ser 


Lys 


Thr 


Val 


Glu 
10 


Leu 


Asn 


Ser Phe 


Gly Leu 
15 


Val 


Asn 


Gly 


Asn 
20 


Leu 


Pro 


Leu 


Ser 


He 
25 


Thr 


Leu 


Glu 


Gly Ser 
30 


Asn Phe 


Leu 


Ala 


Asn 
35 


Gly 


His 


Pro 


Phe 


Leu 
40 


Thr 


Glu 


Val 


Pro 


Glu Asn 
45 


He He 


Val 


Thr 


Pro 


Ser 


Pro 


He 


Asp 


Ala 


Lys 


Ser 


Ser 


Lys Asn Asn Glu Asp 




50 










55 










60 






Asp 


Asp 


Val 


Val 


Gly 


Cys 


Phe 


Val 


Gly 


Phe 


His 


Ala Asp Glu 


Pro Arg 


65 










70 










75 






80 


Ser 


Arg 


His 


Val 


Ala 


Ser 


Leu 


Gly 


Lys 


Leu Arg 


Gly 


He Lys 


Phe Met 










85 










90 








95 


Ser 


lie 


Phe 


Arg 
100 


Phe 


Lys 


Val 


Trp 


Trp 
105 


Thr 


Thr 


His 


Trp Val 
110 


Gly Ser 


Asn 


Gly 


His 


Glu 


Leu 


Glu 


His 


Glu 


Thr 


Gin 


Met 


Met 


Leu Leu 


Asp Lys 






115 










120 










12 5 


Asn 


Asp 
130 


Gin 


Leu 


Gly 


Arg 


Pro 
135 


Phe 


Val 


Leu 


He 


Leu 
140 


Pro He 


Leu Gin 


Ala 


Ser 


Phe 


Arg 


Ala 


Ser 


Leu 


Gin 


Pro 


Gly Leu 


Asp 


Asp Tyr Val Asp 


145 










150 










155 






160 


Val 


Cys 


Met 


Glu 


Ser 
165 


Gly 


Ser 


Thr 


Arg 


Val 
170 


Cys 


Gly 


Ser Ser 


Phe Gly 
175 


Ser 


Cys 


Leu 


Tyr 
180 


Val 


His 


Val 


Gly 


His 
185 


Asp 


Pro 


Tyr 


Gin Leu 
190 


Leu Arg 
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Glu 


Ala 


Thr 


Lys 


Val 


Val 


Arg 


Met 


His 


Leu Gly 


Thr 


Phe 


Lys 


Leu 


Leu 






195 










200 










205 








Glu 


Glu 


Lys 


Thr 


Ala 


Pro 


Val 


He 


He 


Asp 


Lys 


Phe 


Gly 


Trp 


Cys 


Thr 




210 










215 










220 










Trp 


Asp 


Ala 


Phe 


Tyr 


Leu 


Lys 


Val 


His 


Pro 


Ser 


Gly 


Val 


Trp 


Glu 


Gly 


225 










230 










235 










240 


Val 


Lys 


Gly 


Leu 


Val 


Glu 


Gly 


Gly 


Cys 


Pro 


Pro 


Gly 


Met 


Val 


Leu 


He 










245 










250 










255 




Asp 


Asp 


Gly 


Trp 


Gin 


Ala 


He 


Cys 


His 


Asp 


Glu 


Asp 


Pro 


He 


Thr 


Asp 








260 










265 










270 






Gin 


Glu 


Gly 


Met 


Lys 


Arg 


Thr 


Ser 


Ala 


Gly Glu 


Gin 


Met 


Pro 


Cys 


Arg 






275 










280 










285 








Leu 


Val 


Lys 


Leu 


Glu 


Glu 


Asn 


Tyr 


Lys 


Phe 


Arg 


Gin 


Tyr 


Cys 


Ser 


Gly 




290 










295 










300 










Lys 


Asp 


Ser 


Glu 


Lys 


Gly 


Met 


Gly 


Ala 


Phe 


Val 


Arg 


Asp 


Leu 


Lys 


Glu 


305 










310 










315 










320 


Gin 


Pile 


Arg 


Ser 


Val 


Glu 


Gin 


Val 


Tyr 


Val 


Trp 


His 


Ala 


Leu 


Cys 


Gly 










325 










330 










335 




Tyr 


Trp 


Gly 


Gly 


Val 


Arg 


Pro 


Lys 


Val 


Pro 


Gly 


Met 


Pro 


Gin 


Ala 


Lys 








340 










345 










350 




Val 


Val 


Thr 


Pro 


Lys 


Leu 


Ser 


Asn 


Gly 


Leu 


Lys 


Leu 


Thr 


Met 


Lys 


Asp 






355 










360 










365 








Leu 


Ala 


Val 


Asp 


Lys 


He 


Val 


Ser 


Asn 


Gly Val 


Gly 


Leu 


Val 


Pro 


Pro 




370 










375 










380 










His 


Leu 


Ala 


His 


Leu 


Leu 


Tyr 


Glu 


Gly 


Leu 


His 


Ser 


Arg 


Leu 


Glu 


Ser 


385 










390 










395 










400 


Ala 


Gly 


He 


Asp 


Gly 


Val 


Lys 


Val 


Asp 


Val 


He 


His 


Leu 


Leu 


Glu 


Met 










405 










410 










415 




Leu 


Ser 


Glu 


Glu 


Tyr 


Gly 


Gly 


Arg 


Val 


Glu 


Leu 


Ala 


Lys 


Ala 


Tyr 


Tyr 








420 










425 










430 






Lys 


Ala 


Leu 


Thr 


Ala 


Ser 


Val 


Lys 


Lys 


His 


Phe 


Lys 


Gly 


Asn 


Gly Val 






435 










440 










445 








lie 


Ala 


Ser 


Met 


Glu 


His 


Cys 


Asn 


Asp 


Phe 


Phe 


Leu 


Leu 


Gly 


Thr 


Glu 




450 










455 










460 










Ala 


He 


Ala 


Leu 


Gly 


Arg 


Val 


Gly 


Asp 


Asp 


Phe 


Trp 


Cys 


Thr 


Asp 


Pro 


465 










470 










475 










480 


Ser 


Gly 


Asp 


Pro 


Asn 


Gly 


Thr 


Tyr 


Trp 


Leu 


Gin 


Gly 


Cys 


His 


Met 


Val 










485 










490 










495 




His 


Cys 


Ala 


Tyr 


Asn 


Ser 


Leu 


Trp 


Met 


Gly Asn 


Phe 


He 


Gin 


Pro 


Asp 








500 










505 










510 






Trp 


Asp 


Met 


Phe 


Gin 


Ser 


Thr 


His 


Pro 


Cys 


Ala 


Glu 


Phe 


His 


Ala 


Ala 






515 










520 










525 








Ser 


Arg 


Ala 


He 


Ser 


Gly 


Gly 


Pro 


Val 


Tyr 


Val 


Ser 


Asp 


Cys 


Val 


Gly 




530 










535 










540 








Lys 


His 


Asn 


Phe 


Lys 


Leu 


Leu 


Lys 


Ser 


Leu 


Ala 


Leu 


Pro 


Asp 


Gly Thr 


545 










550 










555 










560 


lie 


Leu 


Arg 


Cys 


Gin 


His 


Tyr 


Ala 


Leu 


Pro 


Thr 


Arg 


Asp 


Cys 


Leu 


Phe 










565 










570 










575 




Glu 


Asp 


Pro 


Leu 


His 


Asp 


Gly 


Lys 


Thr 


Met 


Leu 


Lys 


He 


Trp 


Asn 


Leu 








580 










585 










590 






Asn 


Lys 


Tyr 


Thr 


Gly 


Val 


Leu 


Gly 


Leu 


Phe 


Asn 


Cys 


Gin 


Gly 


Gly Gly 






595 










600 










605 








Trp 


Cys 


Pro 


Val 


Thr 


Arg 


Arg 


Asn 


Lys 


Ser Ala 


Ser 


Glu 


Phe 


Ser 


Gin 




610 










615 










620 
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Thr Val Thr Cys Leu 
625 

Lys Ser Pro lie Cys 
645 

Phe Lys Asp His Lys 
660 

Val Ser Leu Glu Pro 
675 

lie Val Leu Ser Lys 
690 

Asn Met Leu Asn Thr 
705 

His lie Asp Val Val 
725 

Val Phe Ala Ser Glu 
740 

Val Lys Phe Asp Tyr 
755 

Pro Ser Ala Ser Lys 
770 



Ala Ser Pro Gin Asp lie 
630 635 
He Lys Gly Met Asn Val 
650 

Leu Lys Leu Met Lys Ala 
665 

Phe Thr Phe Glu Leu Leu 
680 

Lys Leu He Gin Phe Ala 
695 

Gly Gly Ala He Gin Ser 
710 715 
Lys He Gly Val Arg Gly 
730 

Lys Pro Val Ser Cys Lys 
745 

Glu Asp Lys Met Leu Arg 
760 

Leu Ser Met Val Glu Phe 
775 



Glu Trp Ser Asn Gly 
640 

Phe Ala Val Tyr Leu 
655 

Ser Glu Lys Leu Glu 
670 

Thr Val Ser Pro Val 
685 

Pro He Gly Leu Val 
700 

Met Glu Phe Asp Asn 
720 

Cys Gly Glu Met Lys 
735 

Leu Asp Gly Val Val 
750 

Val Gin Val Pro Trp 
765 

Leu Phe Stop 
780 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA to rciRNA 

(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 62 to 2407 

(C) IDENTIFICATION METHOD: by experiment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CCAAACCATA GCAAACCTAA GCACCAAACC TCTTTCTTTC AAGATCCTTG AATTCAGTCC 60 
C ATG GCT CCA AGC ATA AGC AAA ACT GTG GAA CTA AAT TCA TTT GGT 106 
Met Ala Pro Ser He Ser Lys Thr Val Glu Leu Asn Ser Phe Gly 
5 10 15 

CTT GTC AAC GGT AAT TTG CCT TTG TCC ATA ACC CTA GAA GGA TCA AAT 154 
Leu Val Asn Gly Asn Leu Pro Leu Ser He Thr Leu Glu Gly Ser Asn 

20 25 30 

TTC CTC GCC AAC GGC CAC CCT TTT CTC ACG GAA GTT CCC GAA AAC ATA 2 02 

Phe Leu Ala Asn Gly His Pro Phe Leu Thr Glu Val Pro Glu Asn He 

35 40 45 

ATA GTC ACC CCT TCA CCC ATC GAC GCC AAG AGT AGT AAG AAC AAC GAG 250 
He Val Thr Pro Ser Pro He Asp Ala Lys Ser Ser Lys Asn Asn Glu 

50 55 60 

GAC GAC GAC GTC GTA GGT TGC TTC GTG GGC TTC CAC GCG GAC GAG CCC 298 
Asp Asp Asp Val Val Gly Cys Phe Val Gly Phe His Ala Asp Glu Pro 
65 70 75 
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AGA 


AGC 


CGA 


CAC 


GTG 


GOT 


TCG 


CTG 


GGG 


AAG 


CTC 


AGA 


GGA 


ATA 


AAA 


TTC 


346 


Arg 


Ser 


Arg 


His 


Val 


Ala 


Ser 


Leu 


Gly Lys 


Leu 


Arg Gly He 


Lys 


Phe 




80 










85 










90 










95 




ATG 


AGC 


ATA 


TTC 


CGG 


TTT 


AAG 


GTG 


TGG 


TGG 


ACC 


ACT 


CAC 


TGG 


GTC 


GGT 


394 


Met 


Ser 


He 


Phe 


Arg 


Phe 


Lys 


Val 


Trp 


Trp 


Thr 


Thr 


His 


Trp Val 


Gly 












100 










105 










110 




AGC 


AAC 


GGA 


CAC 


GAA 


CTG 


GAG 


CAC 


GAG 


ACA 


CAG 


ATG 


ATG 


CTT 


CTC 


GAC 


442 


Ser 


Asn 


Gly 


His 
115 


Glu 


Leu 


Glu 


His 


Glu 
120 


Thr 


Gin 


Met 


Met 


125 




Asp 




AAA 


AAC 


GAC 


CAG 


CTC 


GGA 


CGC 


CCC 


TTT 


GTG 


TTG 


ATT 


CTC 


CCG 


ATC 


CTC 


490 


Lys 


Asn 


Asp 


Gin 


Leu 


Gly Arg 


Pro 


Phe 


Val 


Leu 


He 




Pro 


He 


Leu 








130 










135 










140 










CAA 


GCC 


TCG 


TTC 


CGA 


GCC 


TCC 


CTG 


CAA 


CCC 


GGT 


TTG 


GAT 


GAT 


TAC 


GTG 


538 


Gin 


Ala 
145 


Ser 


Phe 


Arg 


Ala 


Ser 
150 


Leu 


Gin 


Pro 


Gly 


155 








Val 




GAC 


GTT 


TGC 


ATG 


GAG 


AGC 


GGG 


TCG 


ACA 


CGT 


GTC 


TGT 


GGC 


TCC 


AGC 


TTC 


586 


Asp 


Val 


Cys 


Met 


Glu 


Ser 


Gly 


Ser 


Thr 


Arg 


Val 


Cys 


Gly 


Ser 


Ser 


Phe 




160 










165 










170 










175 




GGG 


AGC 


TGC 


TTA 


TAC 


GTC 


CAC 


GTT 


GGC 


CAT 


GAC 


CCG 


TAT 


CAG 


TTG 


CTT 


634 


Gly 


Ser 


Cys 


Leu 


Tyr 


Val 


His 


Val 


Gly His 


Asp 


Pro 


Tyr 


Gin 


Leu 


Leu 












180 










185 










190 






AGA 


GAA 


GCA 


ACT 


AAA 


GTC 


GTT 


AGG 


ATG 


CAT 


TTG 


GGG 


ACG 


TTC 


AAG 


CTT 


682 


Arg 


Glu 


Ala 


Thr 


Lys 


Val 


Val 


Arg 


Met 


His 


Leu 


Gly Thr 


Phe 


Lys 


Leu 










195 










200 










205 








CTC 


GAG 


GAG 


AAA 


ACC 


GCG 


CCA 


GTG 


ATC 


ATA 


GAC 


AAG 


TTT 


GGT 


TGG 


TGT 


730 


Leu 


Glu 


Glu 


Lys 


Thr 


Ala 


Pro 


Val 


He 


He 


Asp 


Lys 


Phe Gly Trp Cys 








210 










215 










220 










ACA 


TGG 


GAC 


GCG 


TTT 


TAC 


TTG 


AAG 


GTG 


CAT 


CCC 


TCA 


GGT 


GTG 


TGG 


GAA 


778 


Thr 


Trp 


Asp 


Ala 


Phe 


Tyr 


Leu 


Lys 


Val 


His 


Pro 


Ser 


Gly Val 


Trp 


Glu 






225 










230 










235 












GGG 


GTG 


AAA 


GGG 


TTG 


GTG 


GAG 


GGA 


GGG 


TGC 


CCT 


CCA 


GGG 


ATG 


GTC 


CTA 


826 


Gly Val 


Lys 


Gly 


Leu 


Val 


Glu 


Gly 


Gly Cys 


Pro 


Pro 


Gly Met Val 


Leu 




240 










245 










250 










255 




ATC 


GAC 


GAC 


GGG 


TGG 


CAA 


GCC 


ATT 


TGT 


CAC 


GAC 


GAG 


GAC 


CCC 


ATA 


ACG 


874 


He 


Asp 


Asp 


Gly 


Trp 
260 


Gin 


Ala 


He 


Cys 


His 
265 


Asp 


Glu 


Asp 


Pro 


He 
270 


Thr 




GAC 


CAA 


GAG 


GGT 


ATG 


AAG 


CGA 


ACC 


TCC 


GCA 


GGG 


GAG 


CAA 


ATG 


CCA 


TGC 


922 


Asp 


Gin 


Glu 


Gly 


Met 


Lys 


Arg 


Thr 


Ser 


Ala 


Gly Glu 


Gin 


Met 


Pro 


Cys 










275 










280 










285 






AGG 


TTG 


GTG 


AAG 


TTG 


GAG 


GAA 


AAT 


TAC 


AAG 


TTC 


AGA 


CAG 


TAT 


TGT 


AGT 


970 


Arg 


Leu 


Val 


Lys 


Leu 


Glu 


Glu 


Asn 


Tyr 


Lys 


Phe 


Arg 


Gin 


Tyr 


Cys 


Ser 








290 










295 










300 








GGA 


AAG 


GAT 


TCT 


GAG 


AAG 


GGT 


ATG 


GGT 


GCC 


TTT 


GTT 


AGG 


GAC 


TTG 


AAG 


1018 


Gly 


Lys 


Asp 


Ser 


Glu 


Lys Gly Met Gly Ala Phe Val Arg Asp Leu Lys 






305 










310 










315 












GAA 


CAG 


TTT 


AGG 


AGC 


GTG 


GAG 


CAG 


GTG 


TAT 


GTG 


TGG 


CAC 


GCG 


CTT 


TGT 


1066 


Glu 


Gin 


Phe 


Arg 


Ser 


Val 


Glu 


Gin 


Val 


Tyr Val 


Trp 


His 


Ala 


Leu 


Cys 


320 










325 










330 










335 




GGG 


TAT 


TGG 


GGT 


GGG 


GTC 


AGA 


CCC 


AAG 


GTT 


CCG 


GGC 


ATG 


CCC 


CAG 


GCT 


1114 


Gly 


Tyr 


Trp 


Gly 


Gly Val 


Arg 


Pro 


Lys 


Val 


Pro 


Gly Met 


Pro 


Gin 


Ala 












340 










345 










350 






AAG 


GTT 


GTC 


ACT 


CCG 


AAG 


CTG 


TCC 


AAT 


GGA 


CTA 


AAA 


TTG 


ACA 


ATG 


AAG 


1162 


Lys 


Val 


Val 


Thr 


Pro 


Lys 


Leu 


Ser 


Asn 


Gly Leu 


Lys 


Leu 


Thr 


Met 


Lys 










355 










360 










365 







58 



GAT 


TTA 


GCG 


GTG 


GAT 


AAG 


ATC GTC 


AGT AAC 


GGA 


GTT 


GGA 


CTG 


GTG 


CCA 


1210 


Asp 


Leu 


Ala 


Val 


Asp 


Lys 


He Val 


Ser Asn 


Gly Val 


Gly Leu Val 


Pro 








370 








375 








380 










CCA 


CAC 


CTG 


GCT 


CAC 


CTT 


TTG TAC 


GAG GGG 


CTC 


CAC 


TCC 


CGT 


TTG 


GAA 


12 58 


Pro 


His 
385 


Leu 


Ala 


His 


Leu 


Leu Tyr 
390 


Glu Gly 


Leu 


His 
395 


Ser 


Arg 


Leu 


Glu 




TCT 


GCG 


GGT 


ATT 


GAC 


GGT 


GTT AAG 


GTT GAC 


GTT 


ATA 


CAC 


TTG 


CTC 


GAG 


1306 


Ser 


Ala 


Gly 


He 


Asp 


Gly Val Lys 


Val Asp 


Val 


He 


His 


Leu 


Leu 


Glu 




400 










405 






410 










415 




ATG 


CTA 


TCC 


GAG 


GAA 


TAC 


GGT GGC 


CGT GTT 


GAG 


CTA 


GCC 


AAA 


GCT 


TAT 


1354 


Met 


Leu 


Ser 


Glu 


Glu 


Tyr Gly Gly Arg Val 


Glu 


Leu 


Ala 


Lys 


Ala 


Tyr 












420 






425 










43 0 




TAC 


AAA 


GCG 


CTC 


ACT 


GCT 


TCG GTG 


AAG AAG 


CAT 


TTC 


AAA 


GGC 


AAT 


GGG 


1402 


Tyr 


Lys 


Ala 


Leu 


Thr 


Ala 


Ser Val 


Lys Lys 


His 


Phe 


Lys 


Gly Asn 


Gly 










435 








440 








445 








GTC 


ATT 


GCG 


AGC 


ATG 


GAG 


CAT TGT 


AAT GAC 


TTC 


TTT 


CTC 


CTT 


GGT 


ACC 


1450 


Val 


He 


Ala 


Ser 


Met 


Glu 


His Cys 


Asn Asp 


Phe 


Phe 


Leu 


Leu 


Gly 


Thr 








450 








455 








460 








GAA 


GCC 


ATA 


GCC 


CTT 


GGG 


CGC GTA 


GGA GAT 


GAT 


TTT 


TGG 


TGC 


ACT 


GAT 


1498 


Glu 


Ala 


He 


Ala 


Leu 


Gly Arg Val 


Gly Asp 


Asp 


Phe 


Trp 


Cys 


Thr 


Asp 






465 










470 






475 










CCC 


TCT 


GGA 


GAT 


CCA 


AAT 


GGC ACG 


TAT TGG 


CTC 


CAA 


GGG 


TGT 


CAC 


ATG 


1546 


Pro 


Ser 


Gly Asp Pro Asn Gly Thr Tyr Trp 


Leu Gin Gly Cys His 


Met 




480 










485 






490 










495 




GTG 


CAC 


TGT 


GCC 


TAC 


AAC 


AGC TTG 


TGG ATG 


GGG 


AAT 


TTT 


ATT 


CAG 


CCG 


1594 


Val 


His 


Cys 


Ala 


Tyr 


Asn 


Ser Leu 


Trp Met 


Gly Asn 


Phe 


He 


Gin 


Pro 












500 




505 








510 




GAT 


TGG 


GAC 


ATG 


TTC 


CAG 


TCC ACT 


CAC CCT 


TGT 


GCC 


GAA 


TTC 


CAT 


GO 


1642 


Asp 


Trp 


Asp 


Met 
515 


Phe 


Gin 


Ser Thr 


His Pro 
52 0 


Cys 


Ala 


Glu 


Phe 
525 


His 


Ala 




GCC 


TCT 


AGG 


GCC 


ATC 


TCT 


GGT GGA 


CCA GTT 


TAC 


GTT 


AGT 


GAT 


TGT 


GTT 


1690 


Ala 


Ser 


Arg 


Ala 


He 


Ser 


Gly Gly 


Pro Val 


Tyr 


Val 


Ser 


Asp 


Cys 


Val 








530 








535 








540 








GGA 


AAG 


CAC 


AAC 


TTC 


AAG 


TTG CTC 


AAG AGC 


CTC 


GCT 


TTG 


CCT 


GAT 


GGG 


1738 


Gly 


Lys 


His 


Asn 


Phe 


Lys 


Leu Leu 


Lys Ser 


Leu 


Ala 


Leu 


Pro 


Asp Gly 






545 










550 






555 












ACG 


ATT 


TTG 


CGT 


TGT 


CAA 


CAC TAT 


GCA CTC 


CCC 


ACA 


CGA 


GAC 


TGT 


TTG 


1786 


Thr 


He 


Leu 


Arg 


Cys 


Gin 


His Tyr 


Ala Leu 


Pro 


Thr Arg Asp Cys Leu 




560 










565 






570 










575 




TTT 


GAA 


GAC 


CCC 


TTG 


CAT 


GAT GGG 


AAG ACA 


ATG 


CTC 


AAA 


ATT 


TGG 


AAT 


1834 


Phe 


Glu 


Asp 


Pro 


Leu 


His 


Asp Gly Lys Thr Met 


Leu 


Lys 


He 


Trp 


Asn 












580 






585 










590 






CTC 


AAC 


AAA 


TAT 


ACA 


GGT 


GTT TTG 


GGT CTA 


TTT 


AAT 


TGC 


CAA 


GGA 


GGT 


1882 


Leu 


Asn 


Lys 


Tyr 


Thr 


Gly Val Leu 


Gly Leu 


Phe 


Asn 


Cys 


Gin 


Gly 


Gly 








595 








600 








605 




GGG 


TGG 


TGT 


CCC 


GTA 


ACT 


AGG AGA 


AAC AAG 


AGT 


GCC 


TCT 


GAA 


TTT 


TCA 


1930 


Gly 


Trp 


Cys 
610 


Pro 


Val 


Thr 


Arg Arg 
615 


Asn Lys 


Ser 


Ala 


Ser 
62 0 


Glu 


Phe 


Ser 




ACT 


GTG 


ACA 


TGC 


TTA 


GCG AGT 


CCT CAA 


GAC 


ATT 


GAA 


TGG 


AGC 


AAT 


1978 


Gin 


Thr 


Val 


Thr 


Cys 


Leu 


Ala Ser 


Pro Gin 


Asp 


He 


Glu 


Trp 


Ser 


Asn 




525 










630 






635 










GGG 


AAA 


AGC 


CCA 


ATA 


TGC 


ATA AAA 


GGG ATG 


AAT 


GTG 


TTT 


GCT 


GTA 


TAT 


2026 


Gly 


Lys 


Ser 


Pro 


He 


Cys 


He Lys 


Gly Met Asn Val 


Phe 


Ala 


Val 


Tyr 


640 










645 






650 










655 





59 



TTG TTC AAG GAG CAC AAA CTA AAG CTC ATG AAG GCA TCA GAG AAA TTG 2074 
Leu Phe Lys Asp His Lys Leu Lys Leu Met Lys Ala Ser Glu Lys Leu 

660 665 670 

GAA GTT TCA CTT GAG CCA TTT ACT TTT GAG CTA TTG ACA GTG TCT CCA 2122 
Glu Val Ser Leu Glu Pro Phe Thr Phe Glu Leu Leu Thr Val Ser Pro 

675 680 685 

GTG ATT GTG CTG TCA AAA AAG TTA ATT CAA TTT GCT CCA ATT GGA TTA 2170 
Val lie Val Leu Ser Lys Lys Leu lie Gin Phe Ala Pro lie Gly Leu 

690 695 700 

GTG AAC ATG CTT AAC ACT GGT GGT GCC ATT CAG TCC ATG GAG TTT GAC 2218 
Val Asn Met Leu Asn Thr Gly Gly Ala lie Gin Ser Met Glu Phe Asp 

705 710 715 

AAC CAC ATA GAT GTG GTC AAA ATT GGG GTT AGG GGT TGT GGG GAG ATG 2266 
Asn His lie Asp Val Val Lys lie Gly Val Arg Gly Cys Gly Glu Met 
720 725 730 735 

AAG GTG TTT GCA TCA GAG AAA CCA GTT AGT TGC AAA CTA GAT GGG GTA 2314 
Lys Val Phe Ala Ser Glu Lys Pro Val Ser Cys Lys Leu Asp Gly Val 

740 745 750 

GTT GTA AAA TTT GAT TAT GAG GAT AAA ATG CTG AGA GTG CAA GTT CCC 23 62 
Val Val Lys Phe Asp Tyr Glu Asp Lys Met Leu Arg Val Gin Val Pro 

755 760 765 

TGG CCT AGT GCT TCA AAA TTG TCA ATG GTT GAG TTT TTA TTT TGA TCCCT 2412 
Trp Pro Ser Ala Ser Lys Leu Ser Met Val Glu Phe Leu Phe Stop 

770 775 780 

GAAGGTGAAT TTGGGATACT ATGATGTTTG ACTCTCTTTT TAAGTAATAA GAGTCATATT 2472 
TTTCTGTTGT AAAAAAAAAA AAAAAA 2498 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 587 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Japanese artichoke (Stachys sieboldii) 
(F) TISSUE TYPE: leaves 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



Thr Asn Gly Ser Asp 
1 5 
Lys Ser Asp Asp Arg 
20 

Gin Phe Arg Ala Ser 
35 

Cys Val Glu Ser Gly 
50 

Ser Leu Tyr Met His 
65 

Ala Val Lys Val Ala 
85 



Leu Glu Arg Glu Thr 
10 

Pro Tyr lie Val Leu 
25 

Leu Gin Pro Gly Val 
40 

Ser Thr Lys Val Asn 
55 

Ala Gly Asp Asp Pro 
70 

Arg His His Leu Gly 
90 



Gin He Val Val Leu Asp 
15 

Leu Pro Leu He Glu Gly 
30 

Asp Asp Phe He Asp He 
45 

Glu Ser Ser Phe Arg Ala 
60 

Phe Thr Leu Val Lys Asp 
75 80 
Thr Phe Arg Leu Leu Glu 
95 



60 



Glu Lys Thr Pro Pro Gly He Val Asp Lys Phe Gly Trp Cys Thr Trp 

100 105 110 

Asp Ala Phe Tyr Leu Asn Val Gin Pro His Gly Val Met Glu Gly Val 

115 120 125 

Gin Gly Leu Val Asp Gly Gly Cys Pro Pro Gly Leu Val Leu He Asp 

130 135 140 

Asp Gly Trp Gin Ser He Cys His Asp Asn Asp Ala Leu Thr Thr Glu 
145 150 155 160 

Gly Met Gly Arg Thr Ser Ala Gly Glu Gin Met Pro Cys Arg Leu He 

165 170 175 

Lys Phe Glu Glu Asn Tyr Lys Phe Arg Glu Tyr Glu Ser Pro Asn Lys 

180 185 190 

Thr Gly Pro Gly Pro Asn Thr Gly Met Gly Ala Phe He Arg Asp Met 

195 200 205 

Lys Asp Asn Phe Lys Ser Val Asp Tyr Val Tyr Val Trp His Ala Leu 

210 215 220 

Cys Gly Tyr Trp Gly Gly Leu Arg Pro Asn Val Pro Gly Leu Pro Glu 
225 230 235 240 

Ala Lys Leu He Glu Pro Lys Leu Thr Pro Gly Leu Lys Thr Thr Met 

245 250 255 

Glu Asp Leu Ala Val Asp Lys He Val Asn Asn Gly Val Gly Leu Val 

260 265 270 

Pro Pro Glu Phe Val Glu Gin Met Tyr Glu Gly Leu His Ser His Leu 

275 280 285 

Glu Ser Val Gly He Asp Gly Val Lys Val Asp Val He His Leu Leu 

290 295 300 

Glu Met Leu Cys Glu Asp Tyr Gly Gly Arg Val Asp Leu Ala Lys Ala 
305 310 315 320 

Tyr Tyr Lys Ala Leu Ser Ser Ser Val Asn Asn His Phe Asn Gly Asn 

325 330 335 

Gly Val He Ala Gly Leu Glu His Cys Asn Asp Phe Met Phe Leu Gly 

340 345 350 

Thr Glu Ala He Thr Leu Gly Arg Val Gly Asp Asp Phe Trp Cys Thr 

355 360 365 

Asp Pro Ser Gly Asp Pro Asn Gly Thr Phe Trp Leu Gin Gly Cys His 

370 375 380 

Met Val His Cys Ala Tyr Asn Ser He Trp Met Gly Asn Phe He His 
385 390 395 400 

Pro Asp Trp Asp Met Phe Gin Ser Thr His Pro Cys Ala Glu Phe His 

405 410 415 

Ala Ala Ser Arg Ala He Ser Gly Gly Pro He Tyr Val Ser Asp Ser 

420 425 430 

Val Gly Lys His Asn Phe Glu Leu Leu Arg Ser Leu Val Leu Pro Asp 

435 440 445 

Gly Ser He Leu Arg Cys Asp Tyr Tyr Ala Leu Pro Thr Arg Asp Cys 

450 455 460 

Leu Phe Glu Asp Pro Leu His Asn Gly Lys Thr Met Leu Lys He Trp 
465 470 475 480 

Asn Tyr Asn Lys Phe Thr Gly Val Val Gly Thr Phe Asn Cys Gin Gly 

485 490 495 

Gly Gly Trp Ser Arg Glu Val Arg Arg Asn Gin Cys Ala Ala Glu Tyr 

500 505 510 

Ser His Ala Val Ser Ser Ser Ala Gly Pro Ser Asp He Glu Trp Lys 
515 520 525 



61 



Gin Gly Thr Ser Pro lie Asp Val 

530 535 
Tyr Leu Phe His Glu Lys Lys Leu 
545 550 
lie Asp lie Thr Leu Glu Pro Phe 
565 

Pro Val Lys Thr Leu Ala Asn Cys 
580 



Asp Gly Val Lys Thr Phe Ala Leu 
540 

Val Leu Ser Lys Pro Ser Asp Lys 
555 560 
Asp Phe Glu Leu lie Thr Val Ser 

570 575 
Thr Val Gin 
585 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to itlRNA 

(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 2 to 1762 

(C) IDENTIFICATION METHOD: by experiment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

G ACA AAC GGG TCG GAT CTT GAG CGG GAA ACT CAA ATA GTC GTG CTC 
Thr Asn Gly Ser Asp Leu Glu Arg Glu Thr Gin lie Val Val Leu 
15 10 15 

GAC AAG TCC GAC GAC AGG CCC TAG ATC GTG CTG CTT CCG CTC ATC GAG 
Asp Lys Ser Asp Asp Arg Pro Tyr He Val Leu Leu Pro Leu He Glu 

20 25 30 

GGG GAG TTT CGG GOT TCC CTT GAG CCC GGT GTG GAT GAT TTT ATC GAT 
Gly Gin Phe Arg Ala Ser Leu Gin Pro Gly Val Asp Asp Phe He Asp 

35 40 45 

ATT TGT GTC GAA AGC GGG TCA ACC AAG GTC AAC GAG TCC TCG TTC CGT 
He Cys Val Glu Ser Gly Ser Thr Lys Val Asn Glu Ser Ser Phe Arg 

50 55 60 

GCT TCG CTC TAG ATG CAC GCC GGT GAT GAC CCT TTT ACC CTG GTG AAG 
Ala Ser Leu Tyr Met His Ala Gly Asp Asp Pro Phe Thr Leu Val Lys 

65 70 75 

GAC GCC GTG AAG GTG GCG CGC CAC CAC CTC GGG ACG TTC AGG CTG CTG 
Asp Ala Val Lys Val Ala Arg His His Leu Gly Thr Phe Arg Leu Leu 
80 85 90 95 

GAG GAG AAA ACT CCG CCG GGG ATC GTC GAC AAA TTC GGG TGG TGC ACG 
Glu Glu Lys Thr Pro Pro Gly He Val Asp Lys Phe Gly Trp Cys Thr 

100 105 110 

TGG GAT GCG TTC TAG CTC AAC GTC GAG CCC CAC GGG GTT ATG GAG GGG 
Trp Asp Ala Phe Tyr Leu Asn Val Gin Pro His Gly Val Met Glu Gly 

115 120 125 

GTG GAG GGG CTG GTT GAC GGG GGA TGT CCG CCG GGG CTG GTG TTG ATC 
Val Gin Gly Leu Val Asp Gly Gly Cys Pro Pro Gly Leu Val Leu He 
130 135 



62 



GAC GAC GGG TGG CAG TCC ATT TGT CAC GAC AAC GAG GCG CTC ACC ACC 
Asp Asp Gly Trp Gin Ser lie Cys His Asp Asn Asp Ala Leu Thr Thr 

145 150 155 

GAG GGG ATG GGG AGA ACC TCC GCC GGA GAG CAA ATG CCC TGC AGG TTG 
Glu Gly Met Gly Arg Thr Ser Ala Gly Glu Gin Met Pro Cys Arg Leu 
160 165 170 175 

ATC AAG TTT GAG GAG AAT TAG AAG TTC AGG GAG TAG GAG AGC CCG AAT 
lie Lys Phe Glu Glu Asn Tyr Lys Phe Arg Glu Tyr Glu Ser Pro Asn 

180 185 190 

AAA ACT GGG CCG GGC CCG AAT ACG GGG ATG GGG GCC TTT ATT CGT GAC 
Lys Thr Gly Pro Gly Pro Asn Thr Gly Met Gly Ala Phe lie Arg Asp 

195 200 205 

ATG AAG GAC AAT TTC AAG AGT GTG GAC TAG GTG TAG GTG TGG CAT GCG 
Met Lys Asp Asn Phe Lys Ser Val Asp Tyr Val Tyr Val Trp His Ala 

210 215 220 

TTG TGT GGT TAT TGG GGC GGG CTC AGG CCC AAT GTT CCG GGC CTG CCC 
Leu Cys Gly Tyr Trp Gly Gly Leu Arg Pro Asn Val Pro Gly Leu Pro 

225 230 235 

GAG GCT AAG CTC ATT GAG CCC AAA CTG ACT CCT GGG CTT AAG ACC ACC 
Glu Ala Lys Leu lie Glu Pro Lys Leu Thr Pro Gly Leu Lys Thr Thr 
240 245 250 255 

ATG GAA GAT TTG GCT GTT GAT AAG ATT GTC AAC AAT GGC GTG GGT CTG 
Met Glu Asp Leu Ala Val Asp Lys lie Val Asn Asn Gly Val Gly Leu 

260 265 270 

GTC CCA CCG GAG TTT GTT GAA CAA ATG TAT GAA GGA TTA CAT TCA CAT 
Val Pro Pro Glu Phe Val Glu Gin Met Tyr Glu Gly Leu His Ser His 

275 280 285 

CTC GAA TCT GTG GGG ATT GAT GGA GTC AAA GTT GAC GTC ATC CAT TTG 
Leu Glu Ser Val Gly lie Asp Gly Val Lys Val Asp Val He His Leu 

290 295 300 

TTG GAA ATG TTG TGT GAA GAC TAT GGT GGG AGA GTG GAC TTA GCC AAG 
Leu Glu Met Leu Cys Glu Asp Tyr Gly Gly Arg Val Asp Leu Ala Lys 

305 310 315 

GCT TAT TAG AAG GCC TTA TCA AGC TCA GTT AAC AAC CAC TTC AAC GGC 
Ala Tyr Tyr Lys Ala Leu Ser Ser Ser Val Asn Asn His Phe Asn Gly 
320 325 330 335 

AAC GGC GTC ATC GCT GGC CTG GAG CAC TGC AAT GAC TTC ATG TTT CTC 
Asn Gly Val He Ala Gly Leu Glu His Cys Asn Asp Phe Met Phe Leu 

340 345 350 

GGA ACC GAG GCC ATT ACC TTG GGT CGT GTC GGG GAT GAT TTT TGG TGC 
Gly Thr Glu Ala He Thr Leu Gly Arg Val Gly Asp Asp Phe Trp Cys 

355 360 365 

ACT GAT CCA TCT GGA GAT CCC AAT GGC ACG TTC TGG TTG CAA GGG TGT 
Thr Asp Pro Ser Gly Asp Pro Asn Gly Thr Phe Trp Leu Gin Gly Cys 

370 375 380 

CAC ATG GTG CAC TGC GCC TAC AAC AGC ATA TGG ATG GGT AAT TTC ATC 
His Met Val His Cys Ala Tyr Asn Ser He Trp Met Gly Asn Phe He 

385 390 395 

CAC CCT GAT TGG GAC ATG TTT CAA TGG ACT CAC CCT TGC GCT GAA TTC 
His Pro Asp Trp Asp Met Phe Gin Ser Thr His Pro Cys Ala Glu Phe 
400 405 410 415 

CAC GCT GCC TCA CGA GCC ATC TCC GGC GGG CCC ATT TAC GTC AGT GAC 
His Ala Ala Ser Arg Ala He Ser Gly Gly Pro He Tyr Val Ser Asp 
420 425 430 



TCG GTC GGA AAG CAC AAC 

Ser Val Gly Lys His Asn 
435 

GAT GGC TCC ATC CTC CGT 

Asp Gly Ser lie Leu Arg 
450 

TGC CTC TTT GAA GAT CCA 

Cys Leu Phe Glu Asp Pro 
465 

TGG AAT TAT AAC AAG TTC 

Trp Asn Tyr Asn Lys Phe 

480 485 

GGT GGC GGG TGG AGC CGG 

Gly Gly Gly Trp Ser Arg 
500 

TAT TCC CAC GCC GTC TCC 

Tyr Ser His Ala Val Ser 
515 

AAG CAA GGA ACG AGT CCG 

Lys Gin Gly Thr Ser Pro 
530 

TTG TAG CTA TTC CAC GAG 

Leu Tyr Leu Phe His Glu 
545 

AAA ATC GAC ATC ACG CTT 

Lys lie Asp lie Thr Leu 

560 565 

TCT CCA GTC AAA ACT CTA 

Ser Pro Val Lys Thr Leu 
580 



TTC GAG CTC CTT AGG AGC 
Phe Glu Leu Leu Arg Ser 
440 

TGT GAT TAG TAC GCG CTT 
Cys Asp Tyr Tyr Ala Leu 
455 

CTT CAC AAT GGC AAG ACT 
Leu His Asn Gly Lys Thr 
470 475 
ACC GGA GTT GTC GGA ACT 
Thr Gly Val Val Gly Thr 
490 

GAA GTG CGT CGC AAC CAA 
Glu Val Arg Arg Asn Gin 
505 

TCT AGC GCT GGT CCG AGT 
Ser Ser Ala Gly Pro Ser 
520 

ATC GAC GTC GAC GGC GTC 
lie Asp Val Asp Gly Val 
535 

AAG AAA CTC GTC CTT TCT 
Lys Lys Leu Val Leu Ser 
550 555 
GAG CCC TTC GAT TTT GAG 
Glu Pro Phe Asp Phe Glu 
570 

GCC AAT TGC ACC GTC CAA 
Ala Asn Cys Thr Val Gin 
585 



CTC GTT CTT CCC 1342 
Leu Val Leu Pro 
445 

CCG ACT CGC GAT 1390 

Pro Thr Arg Asp 

460 

ATG CTC AAA ATT 1438 
Met Leu Lys lie 

TTC AAC TGC CAA 1486 
Phe Asn Cys Gin 
495 

TGC GCT GCC GAG 1534 
Cys Ala Ala Glu 
510 

GAC ATT GAG TGG 1582 
Asp He Glu Trp 
525 

AAA ACA TTC GCG 163 0 

Lys Thr Phe Ala 

540 

AAG CCA TCA GAC 1678 
Lys Pro Ser Asp 

CTG ATA ACC GTT 172 6 
Leu He Thr Val 
575 

1762 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: corn (Zea mays L.) 

(B) STRAIN: Pioneer 3358 
(F) TISSUE TYPE: leaves 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Ala Ser Arg Ala He 
15 

Gly Gin His Asp Phe 
30 

Thr Val Leu Arg Cys 
45 



Gin Ser Thr His Pro Cys Ala Ala Phe His Ala 

5 10 
Ser Gly Gly Pro He Tyr Val Ser Asp Ser Val 

20 25 
Ala Leu Leu Arg Arg Leu Ala Leu Pro Asp Gly 
35 40 



64 



Glu 


Gly 


His 


Ala 


Leu 


Pro 


Thr 


Arg 


Asp 


Cys 


Leu 


Phe 


Ala Asp 


Pro 


Leu 




50 










55 










60 








His 


Asp 


Gly 


Arg 


Thr 


Val 


Leu 


Lys 


He 


Trp Asn 


Val 


Asn Arg 


Phe 


Ala 


65 










70 










75 








80 


Gly 


Val 


Val 


Gly 


Ala 


Phe 


Asn 


Cys 


Gin 


Gly 


Gly Gly 


Trp Ser 


Pro 


Glu 










85 










90 








95 




Ala 


Arg 


Arg 


Asn 


Lys 


Cys 


Phe 


Ser 


Glu 


Phe 


Ser 


Val 


Pro Leu 


Ala 


Ala 








100 










105 








110 






Arg 


Ala 


Ser 


Pro 


Ser 


Asp 


Val 


Glu 


Trp 


Lys 


Ser 


Gly 


Lys Ala 


Gly 


Pro 






115 










120 










125 






Gly 


Val 


Ser 


Val 


Lys 


Asp 


Val 


Ser 


Gin 


Phe 


Ala 


Val 


Tyr Ala 


Val 


Glu 




130 










135 










140 








Ala 


Arg 


Thr 


Leu 


Gin 


Leu 


Leu 


Arg 


Pro 


Asp 


Glu 


Gly 


Val Asp 


Leu 


Thr 


145 










150 










155 








160 


Leu 


Gin 


Pro 


Phe 


Thr 


Tyr 


Glu 


Leu 


Phe 


Val 


Val 


Ala 


Pro Val 


Arg 


Val 










165 










170 








175 




He 


Ser 


His 


Glu 


Arg 


Ala 


He 


Lys 


Phe 


Ala 


Pro 


He 


Gly Leu 


Ala 


Asn 








180 










185 








190 






Met 


Leu 


Asn 


Thr 


Ala 


Gly 


Ala 


Val 


Gin 


Ala 


Phe 


Glu 


Ala Lys 


Lys 


Asp 






195 










200 










205 






Ala 


Ser 


Gly 


Val 


Thr 


Ala 


Glu 


Val 


Phe 


Val 


Lys 


Gly 


Ala Gly 


Glu 


Leu 




210 










215 










220 








Val 


Ala 


Tyr 


Ser 


Ser 


Ala 


Thr 


Pro 


Arg 


Leu 


Cys 


Lys 


Val Asn 


Gly 


Asp 


225 










230 










235 








240 


Glu 


Ala 


Glu 


Phe 


Thr 


Tyr 


Lys 


Asp 


Gly 


Val 


Val 


Thr 


Val Asp 


Val 


Pro 










245 










250 








255 




Trp 


Ser 


Gly 


Ser 


Ser 


Ser 


Lys 


Leu 


Cys 


Cys 


Val 


Gin 


Tyr Val 


Tyr 


Stop 








260 










265 








270 






(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 8: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to mRNA 

(ix) FEATURE: 

(A) NAME/KEY: peptide 

(B) LOCATION: 2 to 817 

(C) IDENTIFICATION METHOD: by experiment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

C CAG TCC ACG CAC CCC TGC GCC GCC TTC CAC GCC GCG TCC CGC GCC 
Gin Ser Thr His Pro Cys Ala Ala Phe His Ala Ala Ser Arg Ala 
5 10 15 

ATC TCC GGC GGG CCC ATC TAG GTC AGC GAG TCG GTG GGG CAG CAC GAC 
He Ser Gly Gly Pro He Tyr Val Ser Asp Ser Val Gly Gin His Asp 
20 25 30 
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TTC GCG CTG CTC CGC CGC CTG GCG CTC CCC GAC GGC ACC GTC CTC CGG 142 
Phe Ala Leu Leu Arg Arg Leu Ala Leu Pro Asp Gly Thr Val Leu A 

35 40 45 

TGC GAG GGC CAC GCG CTG CCC ACG CGC GAC TGC CTC TTC GCC GAC CCG 19 0 

Cys Glu Gly His Ala Leu Pro Thr Arg Asp Cys Leu Phe Ala Asp Pro 

50 55 60 

CTC CAC GAC GGC CGG ACC GTG CTC AAG ATC TGG AAC GTG AAC CGC TTC 23 8 

Leu His Asp Gly Arg Thr Val Leu Lys lie Trp Asn Val Asn Arg Phe 

65 70 75 

GCC GGC GTC GTC GGC GCC TTC AAC TGC CAG GGC GGC GGG TGG AGC CCC 286 
Ala Gly Val Val Gly Ala Phe Asn Cys Gin Gly Gly Gly Trp Ser Pro 
80 85 90 95 

GAG GCG CGG CGG AAC AAG TGC TTC TCG GAG TTC TCC GTG CCC CTG GCC 334 
Glu Ala Arg Arg Asn Lys Cys Phe Ser Glu Phe Ser Val Pro Leu Ala 

100 105 110 

GCG CGC GCC TCG CCG TCC GAC GTC GAG TGG AAG AGC GGC AAG GCG GGG 382 
Ala Arg Ala Ser Pro Ser Asp Val Glu Trp Lys Ser Gly Lys Ala Gly 

115 120 125 

CCA GGC GTC AGC GTC AAG GAC GTC TCC CAG TTC GCC GTG TAG GCG GTC 430 
Pro Gly Val Ser Val Lys Asp Val Ser Gin Phe Ala Val Tyr Ala Val 

130 135 140 

GAG GCC AGG ACG CTG CAG CTG CTG CGC CCC GAC GAG GGC GTC GAC CTC 478 
Glu Ala Arg Thr Leu Gin Leu Leu Arg Pro Asp Glu Gly Val Asp Leu 

145 150 155 

ACG CTG CAG CCC TTC ACC TAC GAG CTC TTC GTC GTT GCC CCC GTG CGC 52 6 

Thr Leu Gin Pro Phe Thr Tyr Glu Leu Phe Val Val Ala Pro Val Arg 
160 165 170 175 

GTC ATC TCG CAT GAG CGG GCC ATC AAG TTC GCG CCC ATC GGA CTC GCC 574 
Val lie Ser His Glu Arg Ala lie Lys Phe Ala Pro lie Gly Leu Ala 

180 185 190 

AAC ATG CTC AAC ACC GCC GGC GCC GTG CAG GCG TTC GAG GCC AAG AAA 622 
Asn Met Leu Asn Thr Ala Gly Ala Val Gin Ala Phe Glu Ala Lys Lys 

195 200 205 

GAT GCT AGC GGC GTC ACG GCA GAG GTG TTC GTG AAG GGC GCA GGG GAG 670 
Asp Ala Ser Gly Val Thr Ala Glu Val Phe Val Lys Gly Ala Gly Glu 

210 215 220 

CTG GTG GCG TAC TCG TCG GCG ACG CCC AGG CTC TGC AAG GTG AAC GGC 718 
Leu Val Ala Tyr Ser Ser Ala Thr Pro Arg Leu Cys Lys Val Asn Gly 

225 230 235 

GAC GAG GCC GAG TTC ACG TAC AAG GAC GGC GTG GTC ACC GTC GAC GTG 766 
Asp Glu Ala Glu Phe Thr Tyr Lys Asp Gly Val Val Thr Val Asp Val 
240 245 250 255 

CCG TGG TCG GGG TCG TCG TCG AAG CTG TGT TGC GTC CAG TAC GTC TAC 814 
Pro Trp Ser Gly Ser Ser Ser Lys Leu Cys Cys Val Gin Tyr Val Tyr 

260 265 270 

TGA GCCGGACGGG CCGATGACTC TGCGTCTCTG CTCCCTGCTG GCCTGCTCAG GAC 873 
Stop 

ATAATCTAAT GTTTAGAGCT TACCAGGTTT TACAGCTCTA TCAGTTTACT TTTGTTTTTC 933 
TGCTCTTCGT TTTTTAAGAA TTATTTCTAT TGTGTGAATT AATGAGTGCT TTCCTTCTAA 993 
AAA 996 
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CLAIMS 



1 . A raffinose synthase gene isolated from a plant and having a nucleotide 
sequence coding for an amino acid sequence of a protein capable of producing raffmose 
by combining a D-galactosyl group through an a(l— >6) bond with a hydroxyl group 
attached to the carbon atom at position 6 of a D-glucose residue in a sucrose molecule. 

2 . The raffinose synthase gene according to claim 1, wherein the plant is a 
dicotyledon. 

3. The raffinose synthase gene according to claim 2, wherein the 
dicotyledon is a leguminous plant 

4. The raffinose synthase gene according to claim 3, wherein the 
leguminous plant is broad bean. 

5 . A raffinose synthase gene having a nucleotide sequence coding for 
protein (a) or (b) as defined below: 

(a) protein having the amino acid sequence of SEQ ID NO:l ; 

(b) protein having an amino acid sequence derived by deletion, replace- 
ment, modification or addition of one or several amino acids in the amino add sequence of 
SEQ ID NO:l, and capable of producing raffinose by combining a D-galactosyl group 
through an a(l-^6) bond with a hydroxyl group attached to the carbon atom at position 6 
of a D-glucose residue in a sucrose molecule. 

6 . A raffinose synthase gene having the nucleotide sequence of SEQ ID 

NO:2. 

7. The raffinose synthase gene according to claim 3, wherein the 
leguminous plant is soybean. 

8 . A raffinose synthase gene having a nucleotide sequence coding for 
protein (a) or (b) as defined below: 

(a) protein having the amino acid sequence of SEQ ID NO:3; 

(b) protein having an amino acid sequence derived by deletion, replace- 
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ment, modification or addition of one or several amino acids in the amino add sequence of 
SEQ ID NO:3, and capable of producing raffinose by combining a D-galactosyl group 
through an a(l->6) bond with a hydroxyl group attached to the carbon atom at position 6 
of a D-glucose residue in a sucrose molecule. 

9. A raffinose synthase gene having the nucleotide sequence of SEQ ID 

NO:4. 

10. The raffinose synthase gene according to claim 2, wherein the 
dicotyledon is a lamiaceous plant. 

1 1 . The raffinose synthase gene according to claim 10, wherein the lamia- 
ceous plant is Japanese artichoke. 

12. A raffinose synthase gene having a nucleotide sequence coding for the 
amino acid sequence of SEQ ID NO:5. 

13. A raffinose synthase gene having the nucleotide sequence of SEQ ID 

NO:6. 

14. The raffinose synthase gene according to claim 1, wherein the plant is a 
monocotyledon. 

15. The raffinose synthase gene according to claim 14, wherein the mono- 
cotyledon is a gramineous plant. 

16. The raffinose synthase gene according to claim 15, wherein the grami- 
neous plant is com. 

17. A raffinose synthase gene having a nucleotide sequence coding for the 
amino acid sequence of SEQ ID NO:7. 

18. A raffinose synthase gene having the nucleotide sequence of SEQ ID 

NO: 8. 

19. A raffinose synthase protein having amino acid sequence (a) or (b) as 
defined below: 

(a) amino acid sequence of SEQ ED NO:l or SEQ ID NO:3; 

(b) amino acid sequence derived by deletion, replacement, modification or 
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addition of one or several amino acids in the amino acid sequence of SEQ ID NO: 1 or 
SEQIDNO:3; 

the protein being capable of producing raffinose by combining a D-galactosyl 
group through an a(l— >6) bond with a hydroxyl group attached to the carbon atom at 
position 6 of a D-glucose residue in a sucrose molecule. 

20. A raffinose synthase protein having the amino acid sequence of SEQ ID 
NO:l or SEQIDNO:3. 

21 . A gene fragment having a partial nucleotide sequence of the raffinose 
synthase gene of claim 1, 2, 3, 4, 7, 10, 11, 14, 15 or 16. 

22. A gene fragment having a partial nucleotide sequence of the raffinose 
synthase gene of claim 5, 6, 8, 9, 12, 13, 17 or 18. 

23 . The gene fragment accordmg to claim 21 or 22, wherein the number of 
nucleotides is in the range of from 15 to 50. 

24. A method for the detection of a raffinose synthase gene olr a-gene 
fragment having a partial nucleotide sequence thereof, which comprises hybridizing a 
probe of the labeled gene fragment of claim 21, 22 or 23 to an organism-derived genomic 
DNA or cDNA fragment; and detecting the DNA fragment bound specifically to the 
probe. 

25. A method for the detection of a raffinose synthase gene or a gene 
fragment having a partial nucleotide sequence thereof, which comprises hybridizing a 
probe of the labeled gene fragment of claim 21, 22 or 23 to a plant-derived genomic DNA 
or cDNA fragment; and detecting the DNA fragment bound specifically to the probe. 

26. A method for the amplification of a raffinose synthase gene or a gene 
fragment having a partial nucleotide sequence thereof, which comprises annealing a 
primer having a nucleotide sequence of the gene fragment of claim 21, 22 or 23 to 
organism-derived genomic DNA or cDNA; and amplifying the resulting DNA fragment 
by polymerase chain reaction. 

27. A method for the amplification of a raffinose synthase gene or a gene 
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fragment having a partial nucleotide sequence thereof, which comprises annealing a 
primer having a nucleotide sequence of the gene fragment of claim 21, 22 or 23 to plant- 
derived genomic DNA or cDNA; and amplifying the resulting DNA fragment by 
polymerase chain reaction. 

28 . A method for obtaining a raffinose synthase gene, comprising the steps 
of identifying a DNA fragment containing a raffinose synthase gene or a gene fragment 
having a partial nucleotide sequence thereof by the method of claim 24, 25, 26 or 27; and 
isolating and purifying the DNA fragment identified. 

29. A raffinose synthase gene obtained by identifying a DNA fragment 
containing a raffinose synthase gene or a gene fragment having a partial nucleotide 
sequence thereof by the method of claim 24, 25, 26 or 27, and isolating and purifying the 
DNA fragment identified. 

30. A chimera gene comprising the raffinose synthase gene of claim 1, 2, 3, 
4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18 or 29, and a promoter Unked thereto. 

31. A transformant obtained by introducing the chimera gene of claim 30 
into a host organism. 

32. Aplasmid comprising the raffinose synthase gene of claim 1,2, 3, 4, 5, 
6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 29 or 30. 

33. A host organism transformed with the plasmid of claim 32, or a cell 
thereof. ' 

34. A microorganism transformed with the plasmid of claim 32. 

35. A plant transformed with the plasmid of claim 32, or a cell thereof. 

36. A method for metabolic modification, which comprises introducing the 
raffinose synthase gene of claim 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 
18, 29 or 30 into a host organism or a cell thereof, so that the content of raffinose family 
oligosaccharides in the host organism or the cell thereof is changed. 

37. A method for the production of a raffinose synthase protein, which 
comprises isolating and purifying a raffinose synthase protein from a culture obtained by 
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cultivating the microorganism of claim 34. 

38. An anti-rafFinose synthase antibody capable of binding to the raffinose 
synthase protein of claim 19 or 20. 

39. A method for the detection of a raffinose synthase protein, which 
comprises treating a test protein with the anti-raffinose synthase antibody of claim 38; and 
detecting the raffinose synthase protein by antigen-antibody reaction between the antibody 
and the raffinose synthase protein. 
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ABSTRACT OF THE DISCLOSURE 



Raffinose synthase genes coding for proteins capable of producing raffinose 
by combining a D-galactosyl group through an a(l-^6) bond with a hydroxyl group 
attached to the carbon atom at position 6 of a D-glucose residue in a sucrose molecule 
were isolated from various plants. These raffinose synthase genes are useful to change 
the content of raffinose family oligosaccharides in plants. 
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Fig. 1 




Fig. 2 




Fig. 3 
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